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TITLE: A method of producing a low molecular weight organic compound in a cell. 
FIELD OF THE INVENTION: 

5 The present invention relates to a method of producing a low molecular weight organic 
compound (e.g. a plant or bacteria secondary metabolite) in increased yields involving use of a 
cell (e.g. a plant or microorganism cell), which comprises a gene involved in the biosynthesis 
pathway leading to a low molecular weight organic aglycon compound and a glycosyltransferase 
gene capable of glycosylating the produced aglycon. 

10 

BACKGROUND OF THE INVENTION: 

Plants and microbes synthesize a large number of natural substances, in particular secondary 
metabolites, with diverse and generally unclear function. In contrast to the primary metabolites 

15 (e.g. amino acids, sugars, fatty acids), which are involved in fundamental functions like 
metabolism, growth, maintenance and survival, secondary metabolites are not required for 
fundamental functions. Many secondary metabolites from plants are known to act as repellants 
or natural pesticides in defense against herbivore animals or as sexual attractants to pollinating 
insects (Grisebach, 1988, In: European Conference on Biotechnology, Scientific, technical and 

20 industrial challenges, Verona, Italy, 7-8 November 1988, pages 23-27), whereas fungal 
secondary metabolites often act as phytotoxins (Osboum, 2001, Proceedings of the National 
Academy of the USA 98: 14187-14188). 

Various secondary metabolites from more than 100 different plant species have been shown to 
25 exert antimicrobial activity (Cowan, 1999, Clinical Microbiology Reviews 12: 564-582) and a 
large number of secondary metabolites from common food plants are not only responsible for the 
taste and color but are also believed to have health promoting activities (Eastwood, 2001, 
Quarterly Journal of Medicine 94: 45-48; Drewnowski & Gomez-Carneros, 2000, American 
Journal of Clinical Nuuition 72: 1424-1435 ). Accordingly, these natural substances are 
30 economically important in such different fields as drugs, food additives, fragrances, pigments, 
and pesticides. 
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Secondary metabolites often accumulate in small quantities and sometimes only in specialized 
cells. Hence their extraction can be difficult and inefficient In spite of the progress in organic 
chemical synthesis, a large number of these metabolites have such complex structures that they 
are virtually impossible to synthesize at economic levels. Moreover, the natural product is 
5 generally more acceptable to consumers than an artificially produced one. Consequently, 
industrial application of these substances and their functional analogues often relies on natural 
extraction from plants. 

Secondary metabolites may generally be structurally qualified as low molecular weight organic 
10 compounds. 

Industrial production of secondary metabolites or other natural and non natural low molecular 
weight organic compounds can be facilitated by abiotechnological approach. By transformation 
of genes involved in Ihe biosynthesis of a desired natural product, plants or microbes can e.g. be 
1 5 manipulated to produce a compound not previously present in the plant or organism 

Glycosyltransferase may be defined as an enzyme which transfers residues of sugars (galactose, 
xylose, rhamnose, glucose, etc) to acceptor molecules. Acceptor molecules maybe other sugars, 
proteins, lipids and other organic substrates. The acceptor molecule may be termed an aglycon 
20 (aglucone if sugar is glucose). An aglycon may be defined as the non-carbohydrate part of a 
glycoside. A glycoside may be defined as an organic molecule with a glycosyl group (organic 
chemical group derived from a sugar or starch molecule) connected to it by way of e.g. an 
intervening oxygen atom. 

25 These glycosylated molecules take part in diverse metabolic pathways and processes. The 
transfer of a glycosyl moiety can alter the acceptor's bioactivity, solubility and transport 
properties e.g. within a plant or microbial cell and throughout the plant 

The art describes a number of glycosyltransferases that can glycosylate compounds such as 
30 secondary metabolites from e.g. plants and fungi (Paquette, S. et al, Phytochemistry 62 (2003) 
399-413). 
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WO01/07631, WO01/40491 and (Arend, J et al., Biotech. & Bioeng (2001) 78:126-131) 
describe that at least some of these glycosyltransferases are capable of glycosylating a number of 
different structurally related secondary metabolites and other structurally related low molecular 
weight organic compounds. 

5 

Accordingly, the skilled person has at his disposal a number of different glycosyltransferases 
capable of glycosylating numerous different secondary metabolites and other structurally related 
low molecular weight organic compounds. 

10 Tattersall, DB et al, Science (2001) 293:1826-8 describes that the entire pathway for synthesis 
of the tyrosine-derived cyanogenic glucoside dhurrin [a seconday metabolite] has been 
transferred from the plant Sorghum bicolor to the plant Arabidopsis thaliana. The entire 
pathway for synthesis included two genes involved in the biosynthesis pathway (CYP79A1 and 
CYP71E1) and a glucosyltransferase (sbHMNGT) capable of glucosylating the last intermediate 

1 5 (p-hydroxymandelonitrile) to get the glucoside dhurrin (see Figure 1 herein). It was 

demonstrated that the transgenic Arabidopsis thaliana plant was capable of producing 4 mg of 
dhurrin per gram of fresh weight. 

Arend, J et al., Biotech. & Bioeng (2001) 78:126-131 and WO01/07631 describes cloning of a 
20 glucosyltransferase from the plant Rauvolfia serpentina. The cloned glucosyltransferase was 
inserted into E. coli bacteria. When the aglucones hydroquinone, vanillin and p- 
hydroxyacetophenone were added to the medium of cultivated cells of the engineered B. coli, the 
corresponding glucosides, arbutin, vanillin-D-glucoside and picein were synthesized They also 
were released from the cells into the surrounding medium 

25 

Moehs, CP et al, Plant Journal (1997) 1 1 :227-236 describes that a cDNA encoding a solanidine 
glucosyltransferase (SGT) was isolated from potato. The cDNA was selected from a yeast 
expression library using a positive selection based on the higher toxicity of steroidal alkaloid 
aglycons relatively to their corresponding glycosylated forms. The activity of the cloned SGT 
30 was tested in an in vitro assay based on isolated recombinant produced SGT. 
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US6372461 describes a method for making the secondary metabolite vanillin by use of an E. coli 
cell where there has been introduced genes involving in the biosynthesis pathway starting from 
glucose and leading to vanillic acid. The recombinant E. coli can produce vanillic acid when 
cultured in a medium comprising glucose. The produced vanillic acid is recovered from the 
5 fermentation broth and reduced to vanillin with aryl-aldehyde dehydrogenase. 

SUMMARY OF THE INVENTION: 

The problem to be solved by the present invention is to provide a new method of producing in 
10 particular low molecular weight organic compounds of interest, wherein the method provides the 
possibility of obtaining the compound in higher yields. 

Among other things, the solution is based on mat Ihe present inventors have investigated and 
compared the following two cultivated microorganisms: 
15 (i) amicroorgamsmcomprismgagenemvolvedmmebiosynmesispam^ a 

low molecular weight aglycon compound; and 
(n) the same microorganism but where there is also introduced a glycosyltransferase gene 
capable of glycosylating the produced aglycon to get the associated glycosylated 
form of the aglycon. 

20 

The inventors found that the microorganism with the glycosyltransferase during culture 
fermentation is capable of producing higher amounts of the glycosylated form of the aglycon as 
compared to the amounts of the corresponding aglycon produced by the microorganism without 
the glycosyltransferase. See working examples herein for an illustrative example where an E. coli 
25 cell of type (ii) above produces higher amounts of vanillin glucoside as compared to the amounts 
of the corresponding vanillin aglucone produced in a corresponding E. coli without the 
glycosyltransferase (E. coli cell of type (i) above). 

Accordingly, a microorganism cell of the type (ii) above may then be used to obtain the 
30 glycosylated form of the corresponding aglycon in high amounts. However, it may also be used 
in a process to make the aglycon in higher amounts simply by e.g. first making the glycosylated 
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form of the aglycon, recovering it and deglycosylate it according to standard protocols (e.g. 
enzymatically by use of a P-glucosidase or by adequate chemical hydrolysis.) 

Accordingly, a first aspect of the invention relates to a method of producing a low molecular 
5 weight organic compound comprising following steps: 

a) fermenting a microorganism cell or a filamentous fungi cell, which comprises a gene 
encoding a product involved in the biosynthetic pathway leading to a low molecular 
weight organic aglycon compound and a heterologous glycosyltransferase gene encoding 
a glycosyltransferase capable of glycosylating the produced aglycon, in a suitable 

10 medium, wherein the cell produces the aglycon and the corresponding glycosylated form 

of the aglycon; and 

b) recovering the glycosylated form of the aglycon compound; 

(i) wherein the low molecular weight organic aglycon compound has a molecular 
weight from 50 to 3000 g/mol, and 
1 5 (ii) wherein the glycosyltransferase is a glycosyltransferase capable of conjugating 

a sugar to the aglycon compound, wherein the sugar is a sugar selected from the 
group consisting of galactose, glucosamine, N-acetylglucosamine, xylose, 
glucuronic acid, rhamnose and glucose. 

20 A second aspect of the invention relates to a microorganism cell or a filamentous fungi cell that 
comprises a gene encoding a product involved in the biosynthetic pathway leading to a low 
molecular weight organic aglycon compound and a heterologous glycosyltransferase gene 
encoding a glycosyltransferase capable of glycosylating the produced aglycon, wherein the cell, 
when fermented in a suitable medium, produces the aglycon and the corresponding glycosylated 

25 form of the aglycon, and wherein the glycosyltransferase is a glycosyltransferase capable of 
conjugating a sugar to the aglycon compound, wherein the sugar is a sugar selected from the 
group consisting of galactose, glucosamine, N-acetylglucosamine, xylose, glucuronic acid, 
rhamnose and glucose. 

30 As said above, the basic principle behind the method described above may be used to produce an 
aglycon of interest. 
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Accordingly, a third aspect of the invention relates to a method of producing a low molecular 
weight organic aglycon compound comprising following steps: 

a) growing a cell, which comprises a gene encoding a product involved in the 
biosynthesis pathway leading to a low molecular weight organic aglycon compound and a 

5 glycosyltransferase gene encoding a glycosyltransferase capable of glycosylating the 

produced aglycon, under suitable conditions wherein the cell produces the aglycon and 
the associated glycosylated form of the aglycon; 

b) deglycosylatdng the glycosylated form of the aglycon; and 

c) recovering the aglycon compound; 

1 0 (i) wherein the low molecular weight organic aglycon compound has a molecular 

weight from 50 to 3000, and 

(ii) wherein the glycosyltransferase is a glycosyltransferase capable of conjugating 
a sugar to the aglycon compound 

1 5 An advantage of using a glycosyltransferase in a method as described herein may further be to 
use the different specificities of known glycosyltransferases. For instance, it is known that some 
glycosyltransferases are enantiomer specific (see e.g. Jones, P et al, J. of Biological Chemistry 
(1999), 274:35483-35491 and WO03/023035). Consequently, if one for instance wants to make 
a specific enantiomer for e.g. an aglycon then one could choose to use such an enantiomer 

20 specific glycosyltransferase. 

A fourth aspect of the invention relates to a method for selecting a cell with increased 
production of a glycosylated form of a low molecular weight organic aglycon compound 
comprising following steps: 
25 a) growing a cell, which comprises a gene encoding a product involved in the 

biosynthesis pathway leading to a low molecular weight organic aglycon compound and a 
glycosyltransferase gene encoding a glycosyltransferase capable of glycosylating the 
produced aglycon, under suitable conditions wherein the cell produces the aglycon and 
the corresponding glycosylated form of the aglycon; 
30 b) treating the cell in a way that changes the expression level of at least one gene 

involved in the biosynthesis pathway leading to a low molecular weight organic aglycon 
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and/or the glycosyltransferase gene capable of glycosylating the produced aglycon in 
order to make a library of cells with different expression levels of the genes; and 
c) selecting a cell that produces a higher amount of the glycosylated form of the aglycon 
as compared to the cell of step a); 
5 (i) wherein the low molecular weight organic aglycon compound has a molecular 

weight from 50 to 3000, and 

(ii) wherein the glycosyltransferase is a glycosyltransferase capable of conjugating 
a sugar to the aglycon compound. 

10 Example 8 herein describes how this method is used to make an Arabidopsis thaliana plant 
capable of producing increased mg of the glucoside dhurrin per gram of fresh weight. The 
starting cell of step a) is the Arabidopsis thaliana transgenic cell described in Tattersall, DB et 
al, Science (2001) 293:1826-8. As explained above the Arabidopsis thaliana transgenic cell 
comprises the entire pathway for synthesis of the cyanogenic glucoside dhurrin. It was 

1 5 demonstrated that the transgenic Arabidopsis thaliana plant was capable of producing 4 mg of 
dhurrin per gram of fresh weight After performing the selecting method as described in example 
8 an Arabidopsis thaliana transgenic cell is selected in step c) that produces more than 6 mg of 
dhurrin per gram of fresh weight. 

20 Without being limited to theory, it is believed to be the first time that a transgenic plant has been 
provided that is capable of producing more than 4 mg per gram of fresh weight of a glycosylated 
form of a low molecular weight organic aglycon compound. 

Accordingly, a fiflh aspect of the invention relates to a transgenic plant capable of producing 
25 more than 5 mg per gram of fresh weight of a glycosylated form of a low molecular weight 
organic aglycon compound, 

(i) wherein the low molecular weight organic aglycon compound has a molecular weight 
from 50 to 3000. 



30 DEFINITIONS: 
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Prior to a discussion of the detailed embodiments of the invention is provided a definition of 
specific terms related to the main aspects of the invention. 

The term "aglycon" denotes non-carbohydrate part of the corresponding glycosylated form of 
5 the aglycon. It may also be defined as an acceptor compound capable of being conjugated to a 
sugar. In a number of relevant examples, the aglycon is an alcohol with a hydroxy group suitable 
for being glycosylated An example of this is p-hydroxymandelonitrile (see figure 1) which has a 
hydroxy group that can be conjugated (glycosylated) with glucose to get dhurrin. (dhuirin is here 
the corresponding glycosylated form of the aglycon j?-hydroxymandelonitrile). In this example, 
1 0 wherein the sugar is glucose the aglycon may be termed aglucone. Further, when the sugar is 
glucose the term glucosylated may be used instead of glycosylated 

An aglycon may also be glycosylated at different group than a hydroxy group, in particular at 
other nucleophilic groups such as a carboxylic acid, SH-, nitrogen, amine or C-C group. 

1 5 The term "gene encoding a product involved in the biosynthesis pathway leading to a low 
molecular weight aglycon compound" should be understood according to the art as a gene 
encoding a product involved in the biosynthesis pathway leading to a low molecular weight 
aglycon compound. The gene encoded product is normally a polypeptide. However the product 
may also for instance be a RNA molecule affecting the expression of a gene. The encoded 

20 product of the gene may be directly involved in the biosynthesis pathway or indirectly via e.g. 
other precursors or intermediated The important issue, independently of the precise mechanism 
behind it, is that the gene makes it possible for the cell to synthesize the aglycon compound of 
interest as described herein. The art describes numerous suitable examples of such genes. Just for 
illustration one example could be CYP71E1 from Sorghum bicolor that starting from p- 

25 hydroxyphenylacetonitrile is involved the biosynthesis pathway leading to a low molecular 

weight aglycon compound /^hydroxymandelonitrile (see Figure 1 herein and Tattersall, DB et al, 
Science (2001) 293:1826-8). Further examples are the genes involved in the biosynthesis 
pathway leading to a low molecular weight aglycon compound vanillin as described in working 
examples herein. 

30 

The term "glycoside" denotes a compound which on hydrolysis gives a sugar and a non-sugar 
(aglycon) residue, e.g. glucosides give glucose, galactosides give galactose. 
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The term "glycosyltransferase" denotes a glycosyltransferase capable of conjugating a sugar to 
an aglycon as described herein. The sugar may e.g. be D and L isomers of galactose, 
glucosamine, xylose, glucuronic acid, rhamnose or glucose. Alternatively the sugar may be a 
5 carbohydrate derivative such as e.g. inositol, D-olivose, rhodinose and etc available as nucleotide 
dephosphates. Further the sugar may for instance be e.g. a monosaccharide, a disaccharide or a 
trisaccharide. If the sugar is glucose the glycosyltransferase may be termed a 
glucosyltransferase. 

10 The term "growing a cell" in relation to step a) of the third aspect should be understood broadly 
in the sense of growing a cell under suitable conditions (temperature, nutrients etc.) that allows 
growth of the cell. If the cell is e.g. a plant cell this means e.g. growth of the plant cell under 
conditions where e.g. a mature plant is obtained If the cell is a microorganism this could e.g. be 
fermenting of the microorganism cell in a suitable medium where the microorganism is capable 

15 of growing. 

The term "recovering" in relation to "recovering the glycosylated form of the aglycon 
compound" of step b) of the first aspect of the invention and "recovering the aglycon 
compound" of step c) of third aspect of the invention should be understood broadly in the sense 

20 that the compound is recovered from the cell or from e.g. the supernatant of the medium where 
the cell for instance is fermented in order to get the compound in a higher purity than before the 
recovery step. The recovery step may include more or less detailed purification steps. Preferably 
the compound is at some point after the recovery step present in a composition where the 
composition comprises at least 4% (w/w) of the compound, more preferably at least 10% (w/w) 

25 of the compound, even more preferably at least 20% (w/w) of the compound and most 
preferably at least 50% (w/w) of the compound. The skilled person is aware of suitable 
purification protocols (e.g. by using adequate purification columns) to obtain the desired purity. 
Preferably after recovering there is recovered at least 10 mg compound, more preferably, there is 
recovered at least 1 g compound, even more preferably there is recovered at least 10 g 

30 compound, and most preferably there is recovered at least 500 g compound Preferably there is 
recovered from 10 mg to 100 kg compound. 
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Embodiments) of the present invention is described below, by way of example(s) only 
DRAWINGS: 

5 Figure 1 : Shows the pathway for synthesis of the tyrosine-derived cyanogenic glucoside dhuirin 
(a seconday metabolite). The pathway for synthesis included two genes involved in the 
biosynthesis pathway (CYP79A1 and CYP71E1) and a glucosyltransferase (sbHMNGT) capable 
of glucosylating the last intermediate (p-hydroxymandelonitrile) to get the glucoside dhurrin. 

10 DETAILED DESCRIPTION OF THE INVENTION: 

Cell 

The cell suitable to growth as specified under step a) of the method of the third aspect of the 
1 5 invention may be any suitable cell such as any eukaryotic or prokaryotic cell. Preferably the cell 
is a cell selected from the group consisting of a plant cell, a filamentous fungal cell and a 
microorganism cell. 

Plants which include a plant cell according to the invention are also provided as axe seeds 
20 produced by said plants. 

In a preferred embodiment of the invention said plant is selected from: corn (Zea. mays), canola 
(Brassica napus, Brassica rapa ssp.), alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale 
cerale), sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (helianthus annuas), wheat 

25 (Tritium aestivum and other species), Triticale, Rye (Secale) soybean (Glycine max), tobacco 
(Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton 
(Gossypium hirsutum), sweet potato (Impomoea batatus), cassava (Manihot esculenta), coffee 
(Cofea spp.), coconut (Cocos nucifera), pineapple (Anana comosus), citrus (Citrus spp.) cocoa 
(Theobroma cacao), tea (Camellia senensis), banana (Musa spp.), avacado (Persea americana), 

30 fig (Ficus casica), guava (Psidium guajava), mango (Mangifer indica), olive (Olea europaea), 
papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia 
intergrifolia), almond (Primus amygdalus), apple (Malus spp), Pear (Pyrus spp), plum and 
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cheny Iree (Prunus spp), Ribes (currant etc.), Vitis, Jerusalem artichoke (Helianthemum spp), 
non-cereal grasses (Grass family), sugar and fodder beets (Beta vulgaris), chicory, oats, barley, 
vegetables, and ornamentals. 

5 Preferably, plants of the present invention are crop plants (for example, cereals and pulses, 
maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea, and other root, tuber 
or seed crops. Important seed crops are oil-seed rape, sugar beet, maize, sunflower, soybean, 
and sorghum or plants grown for their use as spices or pharmaceutical products (Mentha spp, 
clove, Artemesia spp, Thymus spp, Lavendula spp, Allium spp., Hypericum, Catharanthus spp, 

10 Vinca spp, Papaver spp., Digitalis spp, Rawolfia spp., Vanilla spp., Petrusilium spp., Eucalyptus, 
tea tree, Picea spp, Pinus spp, Abies spp, Juniperus spp,. Horticultural plants to which the 
present invention may be applied may include lettuce, endive, and vegetable brassicas including 
cabbage, broccoli, and cauliflower, and carnations and geraniums. 

1 5 The present invention may be applied in tobacco, cucurbits, carrot, strawberry, sunflower, 
tomato, pepper, Chrysanthemum. 

Grain plants that provide seeds of interest include od-seed plants and leguminous plants. Seeds 
of interest include grain seeds, such as com, wheat, barley, nee, sorghum, rye, etc. Oil-seed 
20 plants include cotton soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc. 
Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, 
garden beans, cowpea, mungbean, lima bean, fava been, lentils, chickpea. 

In a further preferred embodiment of the invention said plant is selected from the following 
25 group: maize; tobacco; oil seed rape; potato; soybean. 

The whole genome of Arabidopsis thaliana plant has been sequenced (Paquette, S. et al, 
Phytochemistry 62 (2003) 399-413). Consequenfly, very detailed knowledge is available for this 
plant and it may therefore be a preferred plant cell to work with. 

30 

Accordingly, a very preferred plant cell is an Arabidopsis cell and in particular an Arabidopsis 
thaliana cell. 
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Filamentous fungi includes all filamentous forms of the subdivision Eumycota and Oomycota (as 
defined by Hawksworth et aL, 1995, supra). The filamentous fungi are characterized by a. 
vegetative mycelium composed of chitin, cellulose, glucan, chitosan, mannan, and other complex 
5 polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obligately 
aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding 
of a unicellular thallus and carbon catabolism may be fermentative. 

In a more preferred embodiment, the filamentous fungal cell is a cell of a species of, but not 
10 limited to, Acremonium, Aspergillus, Fusariura, Humicola, Mucor, Myceliophthora, Neurospora, 
Penicillium, Thielavia, Tolypocladium, and Trichoderma or a teleomorph or synonym thereof. 

A preferred microorganism cell suitable to be used in a method as described herein is a 
microorganism cell selected from the group consisting of a yeast cell and prokaryotic cell. 

15 

A preferred yeast cell is selected from the group consisting of Saccharomyces spp e.g. 
Saccharomyces cerevisiae; and Pichia spp. 

In a method as described herein a very preferred cell is a prokaryotic cell. A preferred 
20 prokaryotic cell is selected from the group consisting of Bacillus, Streptomyces, 
Corynebacterium, Pseudomonas and in particular an E. coli cell. 

A preferred Bacillus cell is B. subtilis, B. amyloliquefaciens or B. licheniformis. 

25 A preferred cell is a cell without active genes encoding a beta-glycosidase. Further, it is preferred 
that the cell comprises a permease or other transport protein enabling the cell to release or 
secrete the glycoside to the medium or to another internal compartment than the one where it is 
glycosylated. 

30 Transformation of suitable DNA containing vectors into the cells described above is routine 
work for the skilled person. Suitable vectors can be constructed, containing appropriate 
regulatory sequences, including promoter sequences, terminator fragments, polyadenylation 
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sequences, enhancer sequences, marker genes and other sequences as appropriate. For further 
details see, for example Sambrook et al. (1989) Molecular cloning: A laboratory manual, Cold 
Spring Harbor lab., Cold Spring Harbor, NY; Ausubel, F. M. et al. (eds.) "Current protocols in 
Molecular Biology". John Wiley and Sons, 1995; Harwood, C. R., and Cutting, S. M. (eds.) 
5 "Molecular Biological Methods for Bacillus". John Wiley and Sons, 1990). 

In relation to examples of suitable plant cell vectors and suitable plant transformation techniques 
reference is made to WO02/103022. In relation to filamentous fungi cells reference is made to 
EP869167. 

10 Preferably, the cell is a cell wherein the cell expresses a heterologous glycosyltransferase gene as 
described herein, in particular a microorganism cell or a filamentous fungi cell that expresses a 
heterologous glycosyltransferase gene as described herein. 

hi this connection, the term "heterologous glycosyltransferase gene" should be understood 
1 5 according to the art as a glycosyltransferase gene that have been introduced into a cell that, 
before the introduction, did not express the glycosyltransferase gene. 

Glycosyltransferase: 

20 As described above, the art describes a number of glycosyltransferases that can glycosylate a low 
molecular weight organic compound of interest such as secondary metabolites from e.g. plants 
and fungi. Based on DNA sequence homology of the sequenced genome of the plant 
Arabidopsis thaliana it is believed to contain around 100 different glycosyltransferases. These 
and numerous others have been analyzed in Paquette, S. et al, Phytochemistry 62 (2003) 399- 

25 413. Figure 1 of this article is a so-called multiorganism tree providing names of numerous 
suitable glycosyltransferases. 

WO01/07631, WO01/40491 and (Arend, J et al., Biotech. & Bioeng (2001) 78:126-131) 
describe that at least some of these glycosyltransferases are capable of glycosylating a number of 
30 different structurally related secondary metabolites and other structurally related low molecular 
weight organic compounds. 
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Accordingly, the skilled person has at his disposal a number of different glycosyltransferases 
capable of glycosylating numerous different secondary metabolites and other low molecular 
weight organic compounds. This feet is described in further details below by way of some 
illustrative specific examples. 

5. 

Although glucosyltransferases are generally thought to be relatively substrate specific it has been 
shown that other substrates can be processed at significant levels if they are presented to the 
enzyme. It has been shown that expression of the sorghum CYP79A1 in Arabidopsis resulted in 
the production of large amounts p-hydroxybenrylglucosinolate, which is not a naturally occuring 
10 glucosinolate (S-glucoside) of Arabidopsis (Bak et al, 2000, Plant Physiology 123: 1437-1448). 
The new glucosinolate is produced from p-hydroxyphenylacetaldoxime, which is formed from 
tyrosine by CYP79A1 and subsequently channeled into the pre-existing glucosinolate 
biosynthetic pathway and to the associated glucosyltransferases. 

1 5 Moreover, a glucosyl transferase, arbutin synthase, isolated from the medicinal plant Rauvolfia 
serpentina was shown to have considerable activity to other substrates than the native 
hydroquinone (Arend et al, 2001, Biotechnology and Bioengineering 76: 126-131). When some 
of these substrates, like vanillin or p-hydroxyacetophenone, were fed to cultures of E. coli that 
had been engineered to express the plant glucosyl transferase the substances were taken up and 

20 converted to their corresponding glucosides, vanillin-B-D-ghicoside and picein. These two 
products, both of which are of commercial interest, were subsequently released from the 
bacteria. 

These examples show that glucosyl transferases can act on substrates that are normally not 
25 present in the metabolic pathway and that this can lead to the formation of novel glucosides of 
natural products. 

As said above the skilled person has at his disposal a number of different glycosyltransferases 
capable of glycosylating numerous different secondary metabolites and other low molecular 
30 weight organic compounds. Further, he may by use of known routine methods easily identify 
further suitable glycosyltransferases capable of glycosylating specific low molecular weight 
organic compounds of interest. Below are given some examples of such suitable methods. 
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The numerous known DNA glycosyltransferase sequences may be used to make appropriate 
PCR primers to clone new glycosyltransferases. This could e.g. be done by a method comprising 
following steps: 

5 (a) preparing a cDNA library from a cell (e.g. plant tissue cell) expressing glucosyltransferases, 

(b) using relevant known DNA glycosyltransferase sequences to make appropriate PCR primers 
to amplify part of the glucosyltransferase cDNA from the cDNA library, 

(c) using the DNA obtained in steps (b) as a probe to screen the DNA library prepared from cell 
expressing glucosyltransferases, and 

1 0 (e) identifying and purifying vector DNA comprising an open reading frame encoding a 
glucosyltransferase of interest. 

Alternatively, a suitable glucosyltransferase may be identified by use of a so-called expression 
cloning strategy. A suitable expression cloning strategy could be a method comprising following 
15 steps: 

(a) preparing a cDNA library from a cell (e.g. plant tissue cell) expressing glucosyltransferases; 

(b) introducing the cDNA library into expression cells by use of an expressing vector that gives 
expression (transcription and translation) of the cDNA in a growing expression cell of interest; 

(c) growing the cell in a medium comprising a toxic amount of a low molecular weight organic 
20 aglycon compounds of interest (i.e. only specific individual cells comprising an expressed 

glucosyltransferase capable of glycosylating the aglycon of interest will be able to grow); 

(d) isolating one or more of the growing cells and identify one that expresses the 
glucosyltransferase of interest. 

25 As described in Moehs, CP et al, Plant Journal (1997) 1 1 :227-236 this expression cloning 
strategy may be done by use of a yeast cell. The teaching of the present invention demonstrates 
that the glycosylated form of the aglycon is less toxic to bacteria as compared to the 
corresponding aglycon as such. Consequently, this expression cloning strategy may be done by 
use of bacterial cells such as E. coli. 

30 

Accordingly, a separate aspect of the invention relates to expression cloning method for 
obtaining a glucosyltransferase of interest comprising following steps: 
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(a) preparing a cDNA library from a cell (e.g. plant tissue cell) expressing glucosyltransferases; 

(b) introducing the cDNA library into bacterial expression cells by use of an expression vector 
that gives expression (transcription and translation) of the cDNA in a growing expression cell of 
interest; 

5 (c) growing the cell in a medium comprising a toxic amount of a low molecular weight organic 
aglycon compounds of interest (i.e. only specific individual cells comprising an expressed 
glucosyltransferase capable of glycosylating the aglycon of interest will be able to grow); 
(d) isolating one or more of the growing cells and identify one that expresses the 
glucosyltransferase of interest. 

10 

Preferably, the bacterial expression cells of step (b) are E. coli cells. 

An advantage of an expression cloning strategy as described above it that one directly gets a 
glucosyltransferase capable of glycosylating the aglycon of interest 

15 

Preferably, the glycosyltransferase is a capable of conjugating a sugar to the aglycon compound, 
wherein the sugar is a sugar selected from the group consisting of galactose, glucosamine, N- 
acetylglucosamine, xylose, glucuronic acid, rhamnose and glucose. 

20 In a further preferred embodiment, the glycosyltransferase is a UDPG-glycosyltransferase 
(UGT). UGTs have been identified in plants, animals, ftingi, bacteria and viruses. These 
glycosyltransferases are characterized by utilization of UDP -activated sugar moeties as the donor 
molecule and contain a conserved UGT-defining sequence motif near the C-terminus. See 
(Paquette, S. et al, Phytochemistry 62 (2003) 399-413) for further details in relation to the 

25 definition of this UDPG-glycosyltransferase fonily. 

Preferably, the glycosyltransferase is a glycosyltransferase that conjugates a monosaccharide or a 
disaccharide sugar to an aglycon as described herein. Most preferably glycosyltransferase is a 
glycosyltransferase that conjugates a monosaccharide sugar to an aglycon as described herein. 

30 

In a preferred embodiment, the glycosyltransferase is a glucosyltransferase. 
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Below are described examples of some suitable glycosyltransferases. 

WO01/40491 describes cloning of a glycosyltransferase from the plant Sorghum bicolor named 
sbHMNGT. This and the related homologous glycosyltransferases as described in WO01/40491 
5 are enable of glycosylating at least following low molecular weight organic aglycon compounds 
( S eeTable3ofWO01/40491): 
Cyanohydrins 

I) mandelonitrile 
p-hydroxymandelonitrile 

10 3) acetone cyanohydrin 
benzyl derivatives 

4) hydroquinone 

5) benzyl alcohol 

6) p-hydroxybenzyl alcohol 
15 7) benzoic acid 

8) p-hydroxybenzoic acid 

9) p-hydroxybenzaldehyde 

10) gentisic acid 

II) caffeicacid 

20 1 2) 2-hydroxy cinnamic acid 

13) resveratrol (stilbene) 

14) salicylic acid 

15) p-hydroxymandelic acid 

16) vanillic acid 
25 17) vanillin 

18) 2-hydroxy methoxybenzylalcohol 
flavonoids 

19) quercetin (flavonol) 

20) cyanidin (anthocyanidin) 
30 21)biochaninA(isoflavone) 

22) naringenin (flavanone) 

23) apigenin (flavone) 
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hexanol derivatives 

24) l-hexanol 

25) trans hexen-l-ol 

26) cis hexen-l-ol 

5 27) 3 -methyl hexen-l-ol 

28) 3 -methyl hexen-l-ol 

29) indole acetic acid (plant hormone) 

30) geraniol (monoterpenoid) 

3 1) tomatidine (alkaloid) 
10 32)nerol 

33) p-citronellol 

Arend, J et al., Biotech. & Bioeng (2001) 78:126-131 describes cloning of a glycosyltransferase 
from the plant Rauvolfia serpentina. It is capable of glycosylating at least following low 
1 5 molecular weight organic phenolic aglycon compounds (see table 1): 

Hydroquinone, Vanillin, Saligerin, Resorcinol, Thymol, Phenol, Methylvanillin, p- 
Hydroxyacetophenone, Vanillic acid, p-Methoxyphenol, 3,4-Dimethoxyphenol, Coniferyl 
alcohol, o-Coumaric Acid, p-Coumaric Adic. 

20 WO01/59140 describes a glycosyltransferase from the plant Arabidopsis thaliana. It is capable 
of glycosylating at least following low molecular weight organic aglycon compounds: caffeic 
acid, luteolin, quercitin-, catechinl-, syadinin. 

WO02/1 03022 describes a glycosyltransferase from the plant Arabidopsis thaliana. It is enable 
25 of glycosylating at least following low molecular weight organic aglycon compounds: 
Benzoate substrates, in particular p-hydroxybenzoic acid. 

WO03/02035: describes a glycosyltransferase from the plant Arabidopsis thaliana. It is capable 
of glycosylating at least following low molecular weight organic aglycon phytohonnone 
30 compound: Abscisic acid 
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Below are described suitable assays to measure the activity of a glycosyltransferase of interest 
The assays are described for a glucosyltransferase. However when sugar is not glucose, one just 
has to use the adequate sugar (e.g. galactose) in stead of the glucose. 

5 The ability of a glucosyltransferase to conjugate an aglycon of interest to glucose can for 
example be determined in an assay comprising the following steps. 

a) Incubation of a reaction mixture comprising 14 C_UDP-glucose, aglycon and 
UDPglucose:aglycon-glucosyltransferase at 30°C between 2 minutes and 2 hours 

b) terminating the reaction, and 

1 0 c) chemical identification and quantification of the glucoside produced 

Typically the reaction mixture has a volume of 5 to 2000 pi, but preferably 20 pi and includes 
10-200 mM TrisHCI (pH 7.9); 1-5 \M 14 4C-UDP-glucose (about 11.0 GBqmmol-1); 0-300 jiM 
UDP-glucose; 0-20 mM aglycone; 25 mM y-gluconolactone; 0-2 ng/yl BSA and 0-10 ng/pl 
1 5 UDP-ghicose:aglycon-glucosyltxansferase. (3-glucosidase inhibitors other than y-gluconolactone 
and protein stabilizers other than BSA may be included as appropriate. One possibility to 
terminate the reaction is to acidify the reaction mixture for example by adding 1/10 volume of 
10% acetic acid. 

20 Chemical identification and quantification of the glucoside formed in the reaction mixture may be 
achieved using a variety of methodologies including NMR spectroscopy, TLC analysis, HPLC 
analyses or GLC analysis in proper combinations with mass spectrometric analysis of the 
glucoside. 

Reaction mixtures for analysis by NIVIR spectroscopy usually have a total volume of 0.5-1 ml, 
25 are incubated for 2 hours and include 0-10mM aglycon, e.g.2 mM p-hydroxy-mandelonitrile or 
6.5 mM geraniol, 3 mM UDP-glucose, 2.5 jxg gluceryltransferease, and 0.5 mg BSA. Glucosides 
are extracted for example with ethyl acetate and lyophillized prior to NMR analysis. 

For TLC analysis the reaction mixtures are applied to Silica Gel 60 F254 plates (Merck), dried 
30 and eluted in a solvent such as ethyl acetate : acetone : dichloromethane : methanol H 2 0 

(40:30:12:10:8, v/v). Plates are dried for one hour at room temperature and exposed to storage 
phosphor imaging plates prior to scanning on a Phosphorlmager. Based on the specific 
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radioactivity of the radiolabelled UIDP-glucose, the amount of glucoside formed is quantified. 
The radioactivity may also be determined by liquid scintillation counting ( LSC analysis). In 
some cases, where the glucoside formed is derived from a very hydrophobic aglycon, e.g. 
mandelonitrile, the glucoside can be extracted into an ethyl acetate phase and thereby be 
5 separated from unincorporated 14 C-UDP-glucose. 2 ml of scintillation cocktail are added to 250 
pi of each ethyl acetate extract and analyzed using a liquid scintillation counter. During column 
fractionation, those fractions containing gluceryltransferase activity can be identified using the 
aglycon substrate of interest and ethyl acetate extraction of the glucoside formed. 

10 A glycosyltransferase gene as described herein may be introduced into a cell in order to make a 
cell wherein the cell expresses a heterologous glycosyltransferase gene as described herein, in 
particular a microorganism cell or a filamentous fungi cell that expresses a heterologous 
glycosyltransferase gene as described herein. 

15 A gene involved in the biosynthesis pathway 

As said above, the term a "gene encoding a product involved in the biosynthesis pathway leading 
to a low molecular weight aglycon compound'* should be understood according to the art as a 
gene encoding a product involved in the biosynthesis pathway leading to a low molecular weight 
20 aglycon compound 

The art describes numerous suitable examples of such genes and the numbers are exponentially 
increasing since a number of whole genomes of different plant and microorganisms are 
continuously published 

25 

Reference may for example be made to the textbook "Biochemistry & Molecular Biology of 
Plants", edited by Bob B. Buchanan et al, ISBN 0-943088-37-2. Chapter 24: "Natural Product 
(Secondary Metabolites)" describes examples of a number of different suitable genes involved in 
the biosynthesis pathway leading to a low molecular weight aglycon compound 

30 

Just for illustration one example could be CYP71E1 from Sorghum bicolor that starting from p- 
hydroxyphenylacetonitrile is involved the biosynthesis pathway leading to a low molecular 
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weight aglycon compound p-hydroxymandelonitrile (see figure 1 and Tattersall, DB et al, 
Science (2001) 293:1826-8). 

Further examples are the genes involved in the biosynthesis pathway leading to a low molecular 
5 weight aglycon compound vanillin as described in working examples herein. Even further 
examples are given in Szczebara et al., Nature Biotechnology (2003), 21 : 143-149. This article 
describes a recombinant yeast cell capable of producing hydrocortisone from a simple carbon 
source. The yeast cell comprised eight recombinantly introduced genes involved in the 
biosynthesis pathway. 

10 

As said above the skilled person has at his disposal a number of different biosynthesis pathway 
genes. Further, he may by use of known routine methods easily identify further suitable 
biosynthesis pathway genes. For instance, the numerous biosynthesis pathway gene sequences 
may be used to make appropriate PCR primers to clone new suitable biosynthesis pathway 
1 5 genes. This could e.g. be done by a method comprising following steps: 

(a) preparing a cDNA library from a cell (e.g. plant tissue cell) expressing biosynthesis pathway 
genes of interest, 

(b) using relevant known biosynthesis pathway gene sequences to make appropriate PCR 
primers at to amplify part of the biosynthesis pathway gene encoding cDNA from the cDNA 

20 library, 

(c) using the DNA obtained in steps (b) as a probe to screen the DNA library prepared from cell 
expressing the biosynthesis pathway genes of interest, and 

(e) identifying and purifying vector DNA comprising an open reading frame encoding a 
biosynthesis pathway gene of interest. 

25 

As explained above a cell used in a method as described herein comprises a gene encoding a 
product involved in the biosynthesis pathway. The cell may comprise more than one gene 
encoding a product involved in the biosynthesis pathway. 

30 A suitable example is the transgenic Arabidopsis thaliana cell described in Tattersall, DB et al, 
Science (2001) 293:1826-8. It comprises the two biosynthesis pathway genes CYP79A1 and 
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CYP71E1 and can therefore make aglycon compound /j-hydroxymandelonitrile from tyrosine 
(see Figure 1 herein). 

A further example is the E. coli cell described in working examples herein mat comprises 
5 sufficient biosynthesis pathway genes to make the aglycon vanillin from glucose. 

A gene encoding a product involved in the biosynthesis pathway as described herein may be 
introduced into a cell in order to make a cell wherein the cell expresses a heterologous gene 
encoding a product involved in the biosynthesis pathway as described herein, in particular a 
1 0 microorganism cell or a filamentous fungi cell mat expresses a heterologous gene encoding a 
product involved in the biosynthesis pathway as described herein. 

Tow molecular weight organic asJvcon compound 

1 5 As said above, the low molecular weight organic aglycon compound, as described herein, has a 
molecular weight from 50 to 3000. Preferably, the low molecular weight organic aglycon 
compound has a molecular weight from 50 to 2000, more preferably a molecular weight from 50 
to 1 000, and even more preferably a molecular weight from 50 to 750. The molecular weight is 
the mass of one molecule in atomic mass units. 

20 

Further, the low molecular weight organic aglycon compound is preferably a compound selected 
from the group consisting of more or less saturated Alkyl-, Cycloalkyl-, Cycloalkylalkyl-, 
Arallyl- and Aryl, with 1 to 50 C-atoms and 0 to 20 heteroatoms and optionally substituted, in 
particular with Hydroxy-, Amino-, Sulfide-, Carboxy-, or Nitro groups, at the 1 to 56 C-atoms 
25 and/or 0 to 20 Heteroatoms; more preferably a compound selected from the group consisting of 
more or less saturated Alkyl-, Cycloalkyl-, Cycloalkylalkyl-, Arallyl- and Aryl, with 1 to 32 C- 
atoms and 0 to 10 heteroatoms and optionally substituted, in particular with Hydroxy-, Amino-, 
Sulfide-, Carboxy-, or Nitro groups, at the 1 to 32 C-atoms and/or 0 to 10 Heteroatoms. 

30 In a preferred embodiment, the low molecular weight organic aglycon compound is an alcohol, 
in particular an aromatic alcohol. 
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Plants may be seen as the organic chemists per excellence in nature. More than 200.000 different 
natural products are known from plants. These enable plants to deter herbivores and pests, 
attract pollinators, communicate with other plants and constantly adapt to climatic changes. As 
explained before the majority of these compounds may be termed secondary metabolites. 

5 

The term "secondary metabolite" relates to that plants and microbes synthesize a large number of 
natural substances, in particular secondary metabolites, with diverse and generally unclear 
function. In contrast to the primary metabolites (eg amino acids, sugars, fatty acids), which are 
involved in iundamental functions like metabolism, growth, maintenance and survival, secondary 
10 metabolites are not required for fundamental functions. The term secondary metabolite should 
herein be understood in view of such, according to the art, standard description of the term. 

In a preferred embodiment the low molecular weight organic aglycon compound is a secondary 
metabolite compound, preferably a plant secondary metabolite compound. 

15 

Examples of preferred secondary metabolite compound classes are: 

• Terpenoids 

• Alkaloids 

• Phenylpropanoids 
20 • Phenyl derivatives 

• Hexanol derivatives 

• Flavonoids 

• Coumarins, stilbenes 

• Cyanohydrins 
25 • Glucosinolates 

• Steroids 

• Hormones 

• Antibiotics 

• and Herbicides. 

30 

Of the list above, the more preferred secondary metabolite compound classes are: 

• Terpenoids 
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• Alkaloids 

• Phenylpropanoids 

• Phenyl derivatives 

• Hexanol derivatives 
5 • Flavonoids 

• Coumarins, stilbenes 

• Cyanohydrins 

• and Glucosinolates. 



10 Examples of preferred individual low molecular weight organic aglycon compounds is a 
compound selected from the list consisting ofi 

mandelonitrile, p-hydroxymandelonitrile, acetone cyanohydrin, hydroquinone, benzyl alcohol, p- 
hydroxybenzyl alcohol, benzoic acid, p-hydroxybenzoic acid, p-hydroxybenzaldehyde, gentisic 
acid, caffeic acid, 2-hydroxy cinnamic acid, resveratrol (stilbene), salicyhc acid, p- 

1 5 hydroxymandelic acid, vanillic acid, vanillin, 2-hydroxy methoxybenzylalcohol, quercetin, 

cyanidin (anthocyanidin), biochanin A (isoflavone), naringenin (flavanone), apigenin (flavone), 1- 
hexanol, trans hexen-l-ol, cis hexen-l-ol, 3-methyl hexen-l-ol, 3-methyl hexen-l-ol, indole acetic 
acid (plant hormone), geraniol (monoterpenoid), tomatidine (alkaloid), nerol, p-citronellol, 
saligerin, resorcinol, thymol, phenol, methylvaniHin, p-hydroxyacetophenone, p-methoxyphenol, 

20 3,4-dimethoxyphenol, coniferyl alcohol, o-coumaric acid, p-coumaric adic, 

caffeic acid, luteolin, quercitin-, catechinl-, cyadinin, p-hydroxybenzoic acid, abscisic acid 
(phytohormone), 2,4,5-trichlorophenol (TCP), pentachlorophenol, 4-nitrophenol, 3,5-dibromo- 
4-hydrobenzoic acid, tetracycline, and 2,2-bis-(4-chlorophenyl)-acetic acid. 



25 Growin g a cell 

In a method as described herein, the cell should be grown under conditions where it produces the 
aglycon and the corresponding glycosylated form of the aglycon. An important point in relation 
to the growth of the cell is that adequate intermediate compounds must be available to the cell. 
30 This means for example that if the cell is e.g. an E. coli cell comprising adequate genes involved 
in the biosynthesis pathway leading e.g. to the aglycon compound vanillin from e.g. glucose, then 
the E. coli cell should be fermented under conditions where glucose is available to the cell. 
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If the cell is e.g. a plant cell comprising adequate genes involved in the biosynthesis pathway 
leading to the aglycon compound from e.g. tyrosine then the plant cell should be grown under 
conditions where tyrosine is available to the growing plant 

5 

The skilled person knows how to grow a specific cell of interest to ensure this. 

A method of producing a low molecular weight organic aglvcon compound 

10 As described above the third aspect of the invention relates to a method of producing a low 
molecular weight organic aglycon compound comprising following steps: 

a) growing a cell, which comprises a gene encoding a product involved in the 
biosynthesis pathway leading to a low molecular weight organic aglycon compound and a 
glycosyltransferase gene encoding a glycosyltransferase capable of glycosylating the 

1 5 produced aglycon, under suitable conditions wherein the cell produces the aglycon and 

the associated glycosylated form of the aglycon; 

b) deglycosylating the glycosylated form of the aglycon; and 

c) recovering the aglycon compound; 

(i) wherein the low molecular weight organic aglycon compound has a molecular 
20 weight from 50 to 3000, and 

(ii) wherein the glycosyltransferase is a glycosyltransferase capable of conjugating 
a sugar to the aglycon compound 

All embodiments above are also embodiments of this third aspect. An essential step of this 
25 method is the deglycosylating step b). This step is further described below, where some non- 
limiting examples are included to illustrate a number of technical advantages in relation to this 
deglycosylating step. 

Though glucosides are often desirable natural products and attempts to use transgenic glucosyl 
30 transferases to produce them have been taken (e.g. Arend et al, 2001, Biotechnology and 

Bioengineering 76: 126-131) the corresponding aglucone can be more commercially interesting. 
One example is vanillin, a natural flavour of significant commercial interest Vanillin is a phenolic 
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compound that accumulates in the fruits of the orchid Vanilla sp. in a glucosylated form, gluco- 
vanillin. In order to obtain the desired natural flavour, gluco-vanillin must be deglucosylated. 
This is achieved by fermentation (curing) of the fruits, so-called vanilla beans, whereby 
endogenous beta-glucosidases are activated 

5 

In the step b) of the method of the third aspect of the invention, the glycosylated intermediate of 
the desired natural product is subjected to a deglycosylating step. This may be done by chemical 
hydrolysis according to known methods in the art or enzymatically by e.g. use an enzyme with 
beta-glycosidase activity. Numerous suitable beta-glycosidases are known to the skilled person. 
1 0 This can e.g. be achieved first by recovering glycosylated form of the aglycon for instance by 
extracting the glucosylated intermediate in a suitable solvent, e.g. methanol, or by collecting it 
after excretion from the producing organism or plant. Secondly, the glucosylated intermediate is 
purified and exposed to a beta-glucosidase in vitro or to an adequate chemical hydrolysis. 

1 5 Alternatively, the stabilized glucosylated intermediate can be let alone in the plant and be allowed 
to accumulate in cellular compartments or tissues where it is protected from endogenous beta- 
glucosidase activity. In such a protected spatial environment the glucosylated intermediate is no 
longer exposed to enzymatic activity that could further metabolize its deglucosylated equivalent 
It could, however, also be protected in time, if the desired natural product was kept in its 

20 glucosylated form until developmental changes in the plant had led to a significant decrease in 
the activity of the enzymes that could otherwise metabolize its deglucosylated form. 

It is preferred for the present invention that glycosylation of the desired aglycon is uncoupled 
from its subsequent deglycosylation, since the aglycon would otherwise be subject to forther 
25 metabolic processing. As indicated above, this uncoupling can be either in time or space. 
Preferably it is in space. 

In the final step c) of the method of the third aspect of invention, the desired aglycon is 
recovered. 

30 

The present method will address at least three problems related to biotechnological production 
of organic molecules. First, it will allow for increased production of organic molecules that are 
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stable and constitute the final product of the respective biosynthetic pathway of the production 
organism. Second, it will allow for increased yield of organic molecules that are not the end 
product of the respective biosynthetic pathway and are consequently further metabolized by the 
production organism. Third, it will allow for increased production of organic molecules that are 
5 toxic to the production organism. 

Regarding toxicity is show in examples 1 and 2 that the presence of a heterologous UDPG- 
glucosyltransferase in a microorganism can increase its tolerance to vanillin through 
glucosylation. This principle is evidently not limited to vanillin but can be applied to any toxic 
1 0 substance that can be converted into a less toxic glucoside by the heterologous UDPG- 
glucosyltransferase, including e.g. taxol. 

This approach can be helpful in increasing tolerance to a desired organic molecule, which is the 
end product of its biosynthetic pathway and is therefore accumulated in the production organism 
1 5 to concentrations that might be limiting production. However, the approach might also address 
detoxification of substances that are structurally or biosynthetically unrelated to the desired 
molecule. Such substances could be byproducts of the desired biosynthetic pathway or 
contaminants in the growth medium. 

20 In the industrial fermentation of ethanol from lignocellulose, vanillin has been reported to be one 
of the strongest inhibitors among the hydrolysis byproducts (Delgenes et al,1996, Enzyme 
Microb Technol, 19: 220) and it acts as a strong growth inhibitor at concentrations of 5 g/1 in me 
medium (Pfeifer et al, 1984, Biotechnol Lett 6: 541). Accordingly, fermentation of lignocellulose 
using microorganisms expressing a heterologous UDPG-ghicosyltransferase might not only 

25 increase the yield of ethanol but could potentially also lead to a means for parallel production of 
commercially valuable vanillin after recovery and deglucosylation of non-toxic vanillin glucoside. 

The method as described herein are of direct importance to biotechnological production of 
several organic molecules, since it can increase yield of organic molecules that are stable and 
30 constitute the final product of the respective biosynthetic pathway of me production organism. 
The feasibility of tins approach is shown in examples 3 and 4 using production of vanillin in a 
microbial system as an example. However, it is obvious for those skilled in the art that this 
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principle is not limited to production of vanillin from ferulic acid, but can be applied in virtually 
any biosynthetic pathway where the desired organic molecule is the end product and opposes at 
least some inhibition to production. 

5 When the desired organic molecule is the end product of its biosynthetic pathway it will itself in 
many cases be a limiting factor in the rate of production. This is due to product inhibition of 
enzymes in the biosynthetic pathway and thus leads to reduced production rate. Instead of 
achieving high levels of the desired end product, an intermediate product might build up and 
either accumulate or be directed into other metabolic pathways and consequently become lost 

1 0 for the desired biosynthetic pathway. 

The method will overcome this product inhibition by adding an additional step to the 
biosynthetic pathway, so the inhibiting product is removed and thereby is no longer able to 
inhibit the enzyme. When biosynthesis is no longer suppressed by product inhibition the turnover 
15 of the initial substrates and their conversion into first the desired product and subsequently into 
its glucosylated derivative will substantially increase. According to the method, the glucosylated 
derivative is then extracted and converted back into the desired aglucone by e.g. beta- 
glucosidase activity. 

20 The desired organic molecule might, however, not always be the end product of its biosynthetic 
pathway in the production organism. In such cases, it is further metabolized and might eventually 
be lost for commercial recovery. In examples 5 and 6 is demonstrated how the method can be 
applied to rescue such intermediates. In the examples, an aldehyde oxime is rescued by 
glucosylation of a heterologous UDPG-glucosyltransferase, but it is evident for those skilled in 

25 the art that the principle is general and can be used for any intermediate product of desire that 
can be glucosylated. The examples address a molecule of desire, the oxime aldehyde, which is 
not naturally produced in the production organism, but in other examples the organic molecule 
of desire might as well be a substance that is produced naturally. 

30 The sorghum cytochrome P450 enzyme complex CYP79 catalyses the conversion of the amino 
acid tyrosine into an aldehyde oxime, p-hydroxyphenylacetaldoxime (Bak et al, 2000, Plant 
Physiol 123: 1437). Attempts to transfer the sorghum CYP79 gene to other plants or 
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microorganisms leads to very low production of the oxime, since it is toxic and is being further 
metabolised to at least two presently unidentified substances, X and Y, that might be nitrile and 
alcohol derivates (Halkier et al, 1995, Arch Biochem Biophys 322: 369; Bak et al, 2000, Plant 
Physiol 123: 1437). In plants, e.g. tobacco and Arabidopsis, the oxime is also glucosylated into 
5 p-hydroxyphenyl-(acetaldoxime glucoside) by an oxime specific UDPG-glucosyl transferase. 

It is a general principle of the method that a glucosylated form of Ihe desired organic molecule 
becomes the final product produced in the production organism. In addition to the advantages 
this principle can have in yield, it might also posses some benefits during purification or 
10 extraction of the desired organic molecule. This is demonstrated in example 7, where it is 
exploited that the solubility of the desired molecule changes after glucosylation. 

This principle can be useful if the desired molecule is difficult to separate from other substances 
present in the production organism or the growth medium. The glucosylated form could e.g. be 
1 5 exported to another compartment than the contaminating molecule, thereby fecilitating 
purification. Alternatively, it could be excreted to the growth medium, whereas the 
contaminating molecule is left alone within the organism. Moreover, the purification process 
could utilize that the glucosylated form of the desired molecule has novel chemical properties, - 
including not only solubility but also chromatographic properties etc. 

20 

Method to selecting transgeni c organisms with increased biosynthesis flow 

In example 6, is shown an example of how the method of the third aspect of the invention can be 
utilized to establish commercial production of p-hydroxyphenylacetaldoxime in microorganisms. 
25 This is achieved by transferring both the sorghum CYP79 gene and the oxime specific UDPG- 
glucosyltransferase (see Figure 1) into the microorganism. This results in the production of p- 
hydroxyphenyl-(acetaldoxime glucoside), which is stable and non-toxic. Moreover, this 
glucoside can be extracted and converted into the desired p-hydroxyphenylacetaldoxime by e.g. 
in vitro beta-glucosidase activity. 

30 

Using this approach, the method not only makes the production of the desired molecule, the 
oxime, possible. It also allows for the selection of transgenic microorganisms with high 
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expression levels of the sorghum CYP79 in a very active form, since the toxic product of the 
enzyme is readily detoxified by glucosylation. If the oxime specific UDPG-glucosyl transferase 
was not present, natural selection would favor microorganisms harboring the CYP79 gene in a 
low expressing state and in a mutated form with less activity. 

5 

The example discussed above, iDustrates how the method can be utilized to obtain, through 
selection, more efficient production organisms. This principle is not restricted to the example 
above, but will obviously apply for many other biosynthetic pathways and production organisms. 

1 0 Accordingly, as said above a fourth aspect of the invention relates to a method for selecting a 
cell with increased production of a glycosylated form of a low molecular weight organic aglycon 
compound comprising following steps: 

a) growing a cell, which comprises a gene encoding a product involved in the 
biosynthesis pathway leading to a low molecular weight organic aglycon compound and a 

1 5 glycosyltransferase gene encoding a glycosyltransferase capable of glycosylating the 

produced aglycon, under suitable conditions wherein the cell produces the aglycon and 
the corresponding glycosylated form of the aglycon; 

b) treating the cell in a way that changes the expression level of at least one gene 
involved in the biosynthesis pathway leading to a low molecular weight organic aglycon 

20 and/or the glycosyltransferase gene capable of glycosylating the produced aglycon in 

order to make a library of cells with different expression levels of the genes; and 

c) selecting a cell that produces a higher amount of the glycosylated form of the aglycon 
as compared to the cell of step a); 

(i) wherein the low molecular weight organic aglycon compound has a molecular 
25 weight from 50 to 3000, and 

(ii) wherein the glycosyltransferase is a glycosyltransferase capable of conjugating 
a sugar to the aglycon compound. 

The term "library of cells" may herein also be termed "population of cells". The library of cells 
30 may comprise as little as two different cells. 
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The term "different expression levels of the genes" may be a library comprising individual cells 
with both increased or decreased expression levels of the genes. Preferably, the library mainly 
comprises individual cells with increased expression levels of the genes. 

5 Wherein the cell is within a plant the library will comprise a library of different plants. Preferably 
the library comprises 10 2 plants, more preferably 10 3 plants, even more preferably 10 s plants, 
even more preferably 10 6 plants. 

When the cell is a microorganism the library may normally easily be made relatively bigger. 
10 Consequently, when the cell is a microorganism the library comprises 10 5 cells, more preferably 
10 7 cells, even more preferably 10* cells, even more preferably 10 9 cells. 

The selected cell of this method may be a preferred cell to use in a method of producing a low 
molecular weight organic compound of the first aspect of the invention or a method of 
1 5 producing a low molecular weight organic aglycon compound of the third aspect of the 

inventions or in relation to the different embodiments of these methods to make a low molecular 
weight organic compound. 

Accordingly, an embodiment of the invention relates to a method where there is first selected a 
20 cell as described in the fourth aspect above and thereafter this cell is used in a method of 
producing a low molecular weight organic compound of the first aspect of the invention or a 
method of producing a low molecular weight organic aglycon compound of the third aspect of 
the inventions or in relation to the different embodiments of these methods to make a low 
molecular weight organic compound. 

25 

The treating of the cell step b) above may be done by numerous different strategies known to the 
skilled person. Examples are mutagenesis such as UV treatment, adequate chemical treatment to 
make DNA mutations, random mutagenesis of e.g. specific promoters of the relevant genes, 
sh uffling of cells to make a library of shuffled cells and etc. 

30 

The selected cell of step c) should preferably produce 1 .25 times higher amounts of the 
glycosylated form of the aglycon as compared to the cell of step a), more preferably the selected 
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cell of step c) should preferably produce 1 .5 times higher amounts of the glycosylated form of 
the aglycon as compared to the cell of step a), even more preferably the selected cell of step c) 
should preferably produce 2 times higher amounts of the glycosylated form of the aglycon as 
compared to the cell of step a), and most preferably the selected cell of step c) should preferably 
5 produce 3 times higher amounts of the glycosylated form of the aglycon as compared to the cell 
of step a). 

The selecting may be done in a number of ways according to the art. For instance in a micro titer 
assay where each well comprises a sample of a specific cell and there is measured amounts of the 
10 glycosylated form of the aglycon. There are numerous other ways of doing this according to the 
general knowledge of the skilled person. 

Example 8 describes how this method is used to make an Arabidopsis thaliana plant enable of 
producing increased mg of dhurrin per gram of fresh weight The starting cell of step a) is the 

15 Arabidopsis thaliana transgenic cell described in Tattersall, DB et al, Science (2001) 293:1826- 
8. As explained above the Arabidopsis thaliana transgenic cell comprises the entire pathway for 
synthesis of the cyanogenic glucoside dhurrin. It was demonstrated that the transgenic 
Arabidopsis thaliana plant was capable of producing 4 mg of dhurrin per gram of fresh weight. 
After performing the method as described in example 8 an Arabidopsis thaliana transgenic cell is 

20 selected in step c) that produces more than 6 mg of dhurrin per gram of fresh weight. 

Without being limited to theory, it is believed to be the first time that a plant has been provided 
that is capable of producing more than 4 mg per gram of fresh weight of a glycosylated form of a 
low molecular weight organic aglycon compound. 

25 

Accordingly, as said above a fifth aspect of the invention relates to a plant capable of producing 
more than 5 mg per gram of fresh weight of a glycosylated form of a low molecular weight 
organic aglycon compound, 

(i) wherein the low molecular weight organic aglycon compound has a molecular weight 
30 from 50 to 3000. 
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Preferably, the plant is capable of producing more than 6 mg per gram of fresh weight of a 
glycosylated form of a low molecular weight organic aglycon compound, more preferably the 
plant is capable of producing more than 8 mg per gram of fresh weight of a glycosylated form of 
a low molecular weight organic aglycon compound. Preferably, the plant produces from 5 mg to 
5 40 mg per gram of fresh weight of a glycosylated form of a low molecular weight organic 
aglycon compound 

Preferably, the plant is a transgenic plant. 
10 EXAMPLES: 

General molecular biology methods 

Unless otherwise mentioned the DNA manipulations and transformations were performed using 
1 5 standard methods of molecular biology (Sambrook et aL (1 989) Molecular cloning: A laboratory 
manual, Cold Spring Harbor lab., Cold Spring Harbor, NY; Ausubel, F. M. et al. (eds.) "Current 
protocols in Molecular Biology". John Wiley and Sons, 1995; Harwood, C. R., and Cutting, S. 
M. (eds.) "Molecular Biological Methods for Bacillus". John Wiley and Sons, 1990). 

20 Enzymes for DNA manipulations were used according to the specifications of the suppliers. 

EXAMPLE 1. ANALYSIS OF INCREASED TOLERANCE TO VANILLIN IN 
MICROORGANISMS EXPRESSING A UDPG-GLUCOSYL TRANSFERASE 

25 Vanillin sensitivity is determined in a number of microorganisms. The microorganisms includes 
the Escherichia coli strains DH5-alpha, TOP10, JM109 and K01418; the Pseudomonas 
fluorescens strains DSM 50091, DSM 50108 and DSM 50124; the Bacillus subtilis strain DSM 
704; and the Corynebacterium glutamicum strain DSM 20300. The microorganisms also include 
strains of Saccharomyces cerevisiae, S. uvarum, S. bayanus, S. paradoxus, S. kudriavzevii, S. 

30 mikatae, S. cariocanus, S. servazzdi, S. castellii, S. kluyverii, Kluyveromyces lactis, 
Zygosaccharomyces fermentatii, Torulaspora delbruekii, Debaromyces orientalis and 
Schizosaccharomyces pombe. 
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The minimum inhibitory concentration of vanillin is determined by the two-fold serial dilution 
assay (Hufford et al, 1975, J Phann Sci 4: 789). Furthermore, the concentration of vanillin 
necessary to inhibit growth by more than 50% is determined. Once the sensitivity of vanillin has 
5 been determined in the various bacterial and yeast strains a few of these are selected in order to 
obtain microbial strains displaying a broad range of vanillin sensitivity. 

The selected strains are transformed with a UDPG-glucosyl transferase gene, e.g. Sorghum 
bicolor UDPG-glucosyl transferase UGT85B1 (Jones et al, 1999, J Biol Chem 274: 35483), 
10 Arabidopsis thaliana UDPG-glucosyl transferase UGT89B1 (Limetal, 2002, J Biol Chem 277: 
586) and Rauvolfia serpentina arbutin synthase (Arend et al, 2001, Biotechnol Bioeng 76: 126). 

For expression of UDPG-glucosyl transferase in E. coli , the IPTG-inducible expression vectors 
pSP19gl0L, pKK223-3 andpETlOlD/Topo are employed. In addition, the newly constructed 

1 5 E. coli expression vectors using constitutive E. coli glycolytic gene promoters are used. For 
expression in P. fluorescens, the BHR expression vector pYanni3 are used. For expression of 
UDPG-glucosyl transferase in yeast, the genes are PCR amplified and inserted into plasmid 
pJH259, in which the genes are expressed from the constitutive and strong glycolytic TPI1 
promoter. Alternatively, the TPI1 promoter is exchanged with a PCR-amplified MET25 

20 promoter, which is repressible by methionine. 

The UDPG-glucosyl transferase expressing microorganisms are subjected to analysis of their 
sensitivity to vanillin in the growth medium as described above. An increased tolerance to 
vanillin, determined as an increase in either the minimum inhibitory concentration or the 
25 concentration necessary to inhibit growth by more than 50%, means that transgenic UDPG- 
glucosyl transferase activity has converted some or most of the applied vanillin into vanillin 
glucoside, which is considerably less toxic. 

EXAMPLE 2 IDENTIFICATION OF GLUCOSIDES IN MICROORGANISMS EXPRESSING 
30 A UDPG-GLUCOSYL TRANSFERASE 
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Microorganisms expressing a heterologous UDPG-glucosyl transferase will be grown in medium 
containing appropriate levels of vanillin or ethylvanillin. The microorganisms will be harvested 
and their content of glucosides will be extracted in methanol or other suitable solvents. The 
presence of vanillin glucoside or ethylvanillin glucoside will be established and their levels will be 
5 quantified The content of vanillin glucoside and ethylvanillin glucoside will be compared in 
transgenic and non-transgenic microorganisms as well as in microorganisms grown in medium 
with and without vanillin or ethylvanillin. 

The presence of increased amounts of vanillin glucoside or ethylvanillin glucoside in UDPG- 
1 0 glucosyl transferase expressing microorganisms grown in medium containing vanillin or 
ethylvanillin will indicate that the aglucones are taken up and glucosylated by the transgenic 
organism. 

15 EXAMPLE 3. INCREASED UPTAKE AND TURNOVER OF FERULIC ACID IN 
MICROORGANISMS EXPRESSING A UDPG-GLUCOSYL TRANSFERASE 

Microorganisms capable of converting ferulic acid into vanillin are genetically modified to 
express a heterologous UDPG-glucosyl transferase. Streptomyces setonii strain ATCC 391 16 is 

20 used for these studies since it has previously been employed for industrial production of vanillin 
from ferulic acid (Muheim & Lerch, 1999, Appl Microbiol Biotechnol 51: 456; Muheim et al, 
1998, EP 0885968A1). Alternatively, Pseudomonas putida strain AN103 (Narbad & Gasson, 
1998, Microbiol 144: 1397; Narbad et al, 1997, WO 97/35999), or Amycolatopsis sp. HR167 
(Rabenhorst & Hopp, 1997, EP 0761817A2) are used. The microorganisms are transformed 

25 with the appropriate construct so as to express Sorghum bicolor UDPG-glucosyl transferase 
UGT85B1, Arabidopsis thaliana UDPG-ghicosyl transferase UGT89B1 or Rauvolfia serpentina 
arbutin synthase. 

The genetically modified microorganisms are grown in medium containing appropriate 
30 concentrations of ferulic acid. The uptake of ferulic acid is determined by following its 
concentration in the medium and its metabolic turnover are determined by analysis of the 
concentration of vanillin and vanillin glucoside in the microorganisms and the growth medium. 
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These values are compared to similar analysis from experiments carried out with non-transgenic 
microorganisms. 

An increased uptake of feruollic acid by microorganisms expressing heterologous UDPG- 
5 glucosyl transferase and their accumulation of vanillin glucoside demonstrate that the glucosyl 
transferase has increased the conversion of ferulic acid into vanillin, which is then subsequently 
converted into vanillin glucoside. 

10 EXAMPLE 4. INCREASED YIELD OF VANILLIN THROUGH DEGLUCOSYLATION OF 
VANILLIN GLUCOSIDE PRODUCED IN MICROORGANISMS EXPRESSING A UDPG- 
GLUCOSYL TRANSFERASE 

Microorganisms capable of converting ferulic acid into vanillin are transformed with a gene 
15 encoding a UDPG-glucosyl transferase and are grown in the presence of ferulic acid The 
concentration of ferulic acid in the growth medium is monitored and kept at constant level in 
order to compensate for the amount taken up by the microorganisms and used for the synthesis 
of vanillin. After a period of several days, vanillin is extracted from the microorganisms and 
quantified In addition, vanillin glucoside will be extracted and converted into vanillin through 
20 deglucosylation by beta-glucosidase activity in vitro. The amount of recovered vanillin will be 
quantified 

Similar experiments and quantification of vanillin synthesised as an end product as well as 
vanillin recovered from in vitro deglucosylation of vanillin glucoside are performed in non- 
25 transformed microorganisms. 

A greater amount of total vanillin extracted and recovered from UDPG-glucosyl transferase 
expressing microorganisms as compared to non-transformed microorganisms demonstrates that 
the presence of a heterologous UDPG-glucosyl transferase can lead to more ferulic acid being 
30 channeled into the synthesis of vanillin and its glucosylated form, from which vanillin can later be 
recovered Consequently, total yield of vanillin can be increased through expression of a UDPG- 
glucosyl transferase. 
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EXAMPLE 5. CLONING OF AN OXIME-SPECIFIC PLANT UDPG-GLUCOSYL 
TRANSFERASE 

5 The sorghum CYP79 gene is inserted into an E. coli expression vector under the control of the 
IPTG inducible lac Z promoter. The vector is used for transformation of E. coli, transformants 
are selected and IPTG induction of CYP79 is demonstrated. For expression cloning of an oxime- 
specific plant UDPG-glucosyl transferase, an Arabidopsis cDNA library is cloned into a suitable 
vector, which is then used for transformation of the strain harboring the CYP79 gene. 

10 

Transformants are selected for oxime-specific UDPG-glucosyl transferase activity by IPTG 
induction of the CYP79 gene. This will lead to production of toxic 
p-hydroxyphenylacetaldoxime. Transformants harboring the desired oxime-specific UDPG- 
glucosyl transferase activity are able to detoxify this compound by glucosylation into p- 
15 hydroxyphenyl-(acetaldoxime glucoside). These transformants are rescued and the gene is cloned 
by conventional techniques. 

EXAMPLE 6. HIGH LEVEL PRODUCTION OF A TOXIC AND UNSTABLE OXME 
ALDEHYDE 

20 

A gene encoding the Arabidopsis oxime specific UDPG-glucosyl transferase is inserted into and 
E. coli expression vector under the control of a strong constitutive promoter. The vector is used 
for transformation of E. coli and transformants are selected. Transformants are subsequently 
screened for oxime specific UDPG-glucosyl transferase expression, and those with the highest 
25 level of expression are selected. 

The selected transformants are subsequently used for transformation of another expression 
vector containing the sorghum CYP79 gene under control of a strong constitutive promoter. 
Transformants are selected and subsequently screened for high level expression of the CYP79 
30 gene product. A further screening can select those with optimal CYP79 enzymatic activity based 
on their ability to acumulate p-hydroxyphenyl-(acetaldoxime glucoside). 



Ourref.l068EP01 



38 

EXAMPLE 7. FACILITATING PURIFICATION OF AN ORGANIC MOLECULE BY 
GLUCOSYLATION 

5 The cell used in this example is the E. coli cell of Example 1 3 capable of producing vanillin 
glucoside. 

In continuous fermentation the fermentor is inoculated with media for growth and the 
fermentation is started under aeration conditions. 

10 

The first 12 - 18 hours is run for cell growth and after 12-18 hours the fermentor is fed with 
glucose for production of vanillin glucoside. In the fermentation the growth rate is linear and it is 
anticipated that the limiting fector is the solubility of vanillin glucoside. From other 
corresponding glucosides it is known that the solubility is app. 100 g/1 at STP and increases at 
15 higher temperatures. 

After approximately 30 hours the high concentration is reach and harvesting is started. The cells 
are separated and recycled to the fermentor. 

20 The supernatant is either concentrated to the limit of vanillin glucoside solubility or directly run 
over a column with beta-glucosidase. The supernatant is recirculated to the fermentor. If other 
products should build up it can be passed over a cleaning column before recirculation. It is 
anticipated to have a row of columns where one is being used in process and the others are 
cleaned and regenerated. 

25 

The product is recovered as crystals from the enzymatic treatment In order to decrease the 
amount of recirculated vanillin the temperature is lowered in connection with enzyme treatment. 



30 



EXAMPLE 8. SELECTING OF A PLANT CELL WITH INCREASED DHURRIN 
PRODUCTION 
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The starting cell of is the Arabidopsis thaliana transgenic cell described in Tattersall, DB et al, 
Science (2001) 293:1826-8, The Arabidopsis thaliana transgenic cell comprises the entire 
pathway for synthesis of the cyanogenic glucoside dhurrin. It was demonstrated that the 
transgenic Arabidopsis thaliana plant was capable of producing 4 mg of dhurrin per gram of 
5 fresh weight. 

The cell capable of producing 4 mg of dhurrin is treated with y-irradiation or a by introducing a 
gene having higher CYP71 expression in order to make a library of plant cells. 

1 0 The library of cells are investigated and a Arabidopsis thaliana transgenic cell is selected that 
produces more than 6 mg of dhurrin per gram of fresh weight. 

EXAMPLE 9: Vanillin sensitivity of various strains ofE. coli 

15 The three different E. coli strains TOP10, JM109, and K01418 were grown for 18 hours in the 
appropriate media, after which serial dilutions in growth medium (dilution factors 1, 1/100, 
1/10000 and 1/1000000) were made. Droplets of these cell suspensions were applied to Petri 
dishes containing solid LB growth medium with various concentrations of vanillin, and the plates 
were incubated at the appropriate growth temperature for 24 hours (E. coli), after which they 

20 were photographed. The percentual (w/v) concentration of vanillin at which growth of a given 
microorganism at the 1/10000 dilution was not detectable, and at which growth at the 1/100 
dilution was severely inhibited, was recorded. This number constituted the STV (Sensitive To 
Vanillin) level. The STV numbers for the different microorganisms studied can be seen in Table 
1. 



25 



Strain 


Genotype 


STV 


TOP10 


F- mcrA Delta (mrr-hsdRMS-mcrfiC) Phi801acZ Delta-M15 Delta-lacX74 






recAl deoR araD139 Delta (ara-leu)7697 galU galK rpsL (StrR) endAl 


0.12 




nupG 




JM109 


FtraD36 laclq Delta (lacZ) M15 proA+B+/ el4- (McrA-) Delta (lac- 
proAB) thi gyrA96 (NaIR) endAl hsdR17 (rK- mK-) relAl supE44 recAl 


0.14 
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K01418 



F- Delta (codB-lacI) 3 relAl? bglA677::TnlO spoTl bglB676::Lambda-lacZ 
bglGo-67thi-l 



0.18 



Table 1. Vanillin sensitivity for various E. coli strains on solid growth medium 

EXAMPLE 10: PCR amplification ofS. bicolor UGT85B1, A. thaliana UGT89B1 andR. 
5 serpentina arbutin synthase (AS) genes 

Roche Pwo polymerase and a DNA Engine Thermocycler were used for all PCR amplifications. 
The oligonucleotides referred to in the text can be seen in Table 2. 



10 



vrilgu tlO.fr 




sequence [p 


1 


SB_GT_KK_F 


ATTAGAATTCATGGGCAGCAACGCGCCGCCGCCG 


2 


SB_GT_KK_R 


ATTAAAGCTTTTACTGCTTGCCCCCGACCAGCAG 


3 


SB_GT_pET_F 


CACCATGGGCAGCAACGCGCCGCCGCCG 


4 


SB_GT_pET_R 


TTACTGCTTGCCCCCGACCAGCAG 


5 


SB_GT_S_F1 


ATGGGCAGCAACGCGCCGCCT 


6 


SB_GT_S_F2 


CGCAGCTGCCAGGGAGGCCGG 


7 


SB_GT_S_F3 


GCCTCCGCCGGCCTCGCCGCC 


8 


SB_GT_S_F4 


CAGACCACCAACTGCAGGCAG 


9 


SB_GT_S_R1 


GAGAGGGAGAGGCCGTCGTCG 


10 


AT_GT_KK-F 


ATTAGAATTCATGAAAGTGAACGAAGAAAACAAC 


11 


AT_GT_KK-R 


ATTAAAGCTTTTATTTGTTTAGTCCTAAACTAACGA 
C 


12 


AT_GT_pET_F 


ATGAAAGTGAACGAAGAAAACAAC 


13 


AT_GT_pET_R 


TTATTTGTTTAGTCCTAAACTAACGAC 


14 


AT_GT_S_F1 


ATGAAAGTGAACGAGGAAAAC 


15 


AT_GT_S_F2 


GAATCCCrCGTTTCGATTTCT 


16 


AT_GT_S_F3 


CTTGACGCACGTGAGGATAAC 



17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 

30 
31 

32 
33 

34 
35 

36 

37 
38 
39 



41 



AT_GT_S_F4 CCTGACACGGTGCCTGACCCG ~ 
AT_GT_S_R1 CGGAGGGGATTGAAGGGTGGG 
AS_GT_EC_KK_F ATTAGAATTCATGGAACATACCCCGCACATT 
AS GT EC KK R ATTAGAATTCTTATGTACTGGAAATTTTGTTC 
AS_GT_EC_pET_F CACCATGGAACATACCCCGCACATT 
AS_GT_EC_pET_R TTATGTACTGGAAATTTTGTTC 
AS_GT_S_F1 ATGGAGCATACACCTCACAT 
AS_GT_S_F2 GACGGCCATGTGCCTGTCTC 
ASGTSF3 GGGGCAGTCTCCCATAATCA 
AS_GT_S_F4 AGGGTCTTAAAGTGGCCCTG 
AS_GT_S_R1 TACGGGTCTCTATCCTAACA 
Pfba_F ATTAGAATTCAAAAATCACAGGGCAGGGAAAC 
Pfba_R ATTAGGCGCGCCTCTAGAGTCTCTTGTCCTGTATC 
GTCGGG 

PpfkA_F ATTAGAATTCTCAGTATAAAAGAGAGCCAGAC 
PpfkA_R ATTAGGCGCGCCTCTAGAGACTACCTCTGAACTTT 
GGAAT 

PgapA_F ATTAGAATTCTTGCTCACATCTCACTTTAATC 
PgapA_R ATTAGGCGCGCCTCTAGAATATTCCACCAGCTATT 
TGTTA 

PtpiA_F ATTAGAATTCCAAAAAGCAAAGCCTTTGTGCC 
PtpiA_R ATTAGGCGCGCCTCTAGATTTAATTCTCCACGCTTA 
TAAG 

rcysBJF ATTAGGCGCGCCGGATCCTrTCTrGCGTTATTTTCG 
GCACC 

rcysB_R ATTAAAGCTTGAAAAACCGCCAGCCAGGCTTT 
3B_GT_EC_NP_F ATTATCTAGAATGGGCAGCAACGCGCCGCCGCCG 
?B_GT_EC_NP_R ATTAGGATCCTTACTGCTTGCCCCCGACCAGCAG 
\T_GT_EC_NP_F ATTATCTAGAATGAAAGTGAACGAAGAAAACAAC 
^T_GT_EC_NP_R ATTAGGATCCTTACCACCGTTCTATCTCCATCTTC 
IS_GT_EC_NP_F ATTATCTAGAATGGAACATACCCCGCACATT 
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43 



RS_GT_EC_NP_R ATTAGGATCCTTATGTACTGGAAATTTTGTTC 



Table 2. Oligonucleotides used in this study 



UGT85B1, UGT89B1 and AS for IPTG-controlled expression in E. coli 

5 

For amplification of UGT85B1 for use in the E. coli vector pKK223-3 (Amersham-Phannacia 
Biotech), the oligonucletides JH#1 and JH#2 (Table 2) were used, with the plasmid pSP19glOL- 
UGT85B1 (Jones et al., 1999, J Biol Chem 274: 35483) as template for the reaction. The 
reaction conditions employed were 94 "C, 2 min., 1 cycle, 94 °C, 30 sec, gradient 50-60 °C, 1 

10 min., 72 °C, 2 min., 30 cycles, followed by 72 °C, 7 min., 1 cycle. The reaction contained 5 % 
DMSO. Reaction samples conlaining a fragment of the expected size of 1.5 kb were combined, 
and used for ligation into the pCR-Blunt II-Topo vector. Clones containing EcdSl inserts of 1 .5 
kb were sequenced with the primers JH#5-9. A clone with a correct DNA sequence was 
identified and named pJH400. The UGT85B1 ORF was liberated from pJH400 with^coRI- 

15 Hindll (these restriction sites were included at the S'-teimini of the primers) and inserted in an 
EcdRI-Hindm digested pKK223-3 plasmid A clone with a correct UGT85B1 insert was 
identified and named p JH40 1 . 

For amplification of UGT85B1 for use in the E. coli vector pETlOl -D/Topo, an identical PCR 
20 reaction was performed, only using the primers JH#3 and 4. The same PCR program was 
employed, and a fragment of the correct size of 1 .5 kb was directionally inserted in the 
expression vector pETlOl-D/Topo. The inserts of several clones were sequenced using primers 
JH#5-9. One correct clone was named pJH402. 

25 For amplification of UGT89B1 for use in the E. coli vector pKK223-3, the oligonucletides 

JH#10 and JH#1 1 (Table 2) were used, with genomic Arabidopsis thaliana DNA (var. Columbia 
Col-0) as template for the reaction. The reaction conditions employed were 94 °C, 2 min., 1 
cycle, 94 °C, 30 sec, gradient 50-60 °C, 1 min, 72 °C, 2 min., 30 cycles, followed by 72 °C, 7 
min., 1 cycle. Reaction samples containing a fragment of the expected size of 1.4 kb were 

30 combined, and used for ligation into the pCR-Blunt II-Topo vector. Clones containing EcoRI 
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inserts of 1 .4 kb were sequenced with the primers JH# 14-1 8. A clone with a correct DNA 
sequence was identified and named pJH462. The UGT89B1 ORF was liberated from pJH462 
with EcdRI-HindR (these restriction sites were included at the 5'-termini of the primers) and 
inserted in an EcoW-HintiHL digested pKK223 -3 plasmid. A clone with a correct UGT89B1 
5 insert was identified and named pJH463. 

For amplification of UGT89B1 for use in the R coli vector pETlOl -D/Topo, an identical PGR 
reaction was performed, only using the primers JH#12 and 13. The same PCR program was 
employed, and a fragment of the correct size of 1 .4 kb was directionally inserted in the 
10 expression vector pETl 01-D/Topo. The inserts of several clones were sequenced using primers 
JH#14-18. One correct clone was named pJH406. 

For amplification of AS for use in the E. coli vector pKK223-3, the oligonucletides JH#19 and 
JH#20 (Table 2) were used, with the plasmid pQE60-AS (Arend et ah, 2001, Biotechnol Bioeng 

15 76: 126) as template for the reaction. The reaction conditions employed were 94 °C, 2 min., 1 
cycle, 94 °C, 30 sec, gradient 48-55 °C, 1 min., 72 °C, 2 min., 30 cycles, foDowedby 72 °C, 7 
min., 1 cycle. Reaction samples containing a fragment of the expected size of 1.4 kb were 
combined, and used for ligation into the pCR-Blunt H-Topovector. Clones containing EcdRI 
inserts of 1.5 kb were sequenced with the primers JH#23-27. A clone with a correct DNA 

20 sequence was identified and named pJH409. The AS ORF was liberated from pJH409 with 
EcdRI (EcdRI sites were included at the S'-termini of both primers) and inserted in an EcdRI 
digested pKK223-3 plasmid A clone with a correct AS insert in the correct orientation was 
identified and named pJH41 0. 

25 For amplification of AS for use in the E. coli vector pETlOl -D/Topo, an identical PCR reaction 
was performed, only using the primers JH#21 and 22. The same PGR program was employed, 
and a fragment of the correct size of 1.4 kb was directionally inserted in the expression vector 
pETl 01 -D/Topo. The inserts of several clones were sequenced using primers JH#23-27. One 
correct clone was named pJH41 1 . 

30 

UGT85B1. UGT89B1 and AS for constitutive expression in E. coli 
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A series of tailored expression systems for UGT85B1 and AS were constructed in which the 
glucosyl transferase genes were controlled by one of four glycolytic E. coli gene promoters: Jba 
(primers JH#28 and 29),pJkA (JH#30 and 31), tpiA (JH# 32 and 33) and gapA (JH#34 and 35) 
promoters, and all by the E. coli cysE terminator (primers JH#27 and 28) sequences. All 
5 constructions were based on the following scheme: The promoter fragment was PCR amplified 
from E. coli DH5a genomic DNA, inserted BamHL-AscI (these sites were present in the primers) 
in pKOL30 (Olesen et aL, 2000, Yeast 16: 1035). In the resulting construct the cysB fragment 
was then inserted Ascl-HindUL (sites present in primers). Finally, in the resulting construct, PCR 
amplified UGT85B1 (primers JH#38 and 39), UGT89B1 (primers JH#40 and 41) or AS (JH# 42 
1 0 and 43) were then inserted Xbal-EcciRI (these sites were present inside the AscI sites in the 3 *- 
end primer used for promoter amplification and in the 5 '-end primer used for the terminator 
amplification, as well as in the primers used for amplification of the UGT and AS genes). 

The PCR reaction conditions employed were 94 °C, 2 min., 1 cycle, 94 °C, 30 sec., 55 °C, 1 min. 
1 5 (for promoter and terminator fragments, as well as UGT fragments) or 48 °C, 1 min. (for AS 
fragment), 72 °C, 2 min., 30 cycles, followed by 72 °C, 7 min., 1 cycle. For amplification of 
UGT85B1, 5 % DMSO was included. Reaction samples containing a fragment of the expected 
sizes were combined, gel purified and used for cloning experiments as described above. 

20 The resulting plasmids were pJH430(/6a-UGT85Bl),pJH431 fe/3fcA-UGT85Bl), 

pJH432(gap^A-UGT85Bl), pJH433 (/p*A-UGT85Bl), pJH434 (/Z>a-UGT89B1), pJH435 
(#fcA-UGT89Bl), pJH436 (ga/>A-IJGT89Bl), p JH437 (fc?/A-UGT89Bl), pJH438 (/&a-AS), 
pJH439 fo/fcA-AS), pJH440 feapA-AS) andpJH441 (^pfA-AS). 

25 EXAMPLE 11: Vanillin detoxification by expression of UGT or AS genes E. coli 

The E. coli strains TOP10, JM109, and K01418 were all transformed with the constructed 
plasmids pJH401, pJH402, pJH411 and pJH412, andpKK223-3 as control, employing standard 
transformation protocols. Transformants were selected for ampicilin resistance, and two 
30 transformants of each E. coli strain with each plasmid were kept for expression experiments. The 
E. coli transformants were inoculated from plate cultures into 2 ml liquid growth medium (LB 
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medium containing ampicilin at 100 jig/ml), and allowed to grow for 20 hours at 37 °C. The 
precultures were diluted 100 times 10 4 times and 10 6 times in growth medium, and 4pl droplels 
of these cell suspensions were applied to the surface of Petri dishes containing solid LB-ampicilin 
medium containing varying concentrations of vanillin (based on the STV determinations 
5 described above), and without or with (to induce gene expression) 1 mM isopropyl 

thiogalactoside (EPTG). The plates were incubated at 37 °C for 24 hours, after which growth on 
the various concentrations of vanillin was monitored and recorded. The results are summarized 
in Table 1. 



E. cott strain \ % Vanillin 


0% 


0.08 % 0.12 % 0.16 % 


«. « 1PTG 
0.2 % 

induction 


JM109 |pKK223-3] 


++ 
++ 


++ ++ 
+4- -H- 


No 
Yes 


JM109 [pJH401] 


++ 
++ 


++ 4+ 
++ ++ 


No 
Yes 


JM109 [pJH402] 


++ 
++ 


-H- ++ + 
++++++ 


No 
Yes 


JM109 [pJH410] 


++ 
++ 


++++++ 
++ ++ + 


No 
Yes 


K01418 [pKK223-3] 


++ 
++ 


++ ++ 

++ ++ ++ 


No 
Yes 


KO1418tpJH401] 


++ 
++ 


++ ++ 
++++++ 


No 

• Yes 


KO1418[pJH402] 


++ 
++ 


++ ++ + 
++++++ 


No 
Yes 


K01418 OpJH410] 


++ 
++ 


++ ++ ++ 
++ ++ . ++ 


No 

++ Yes 



10 



Table 1. Effect of EPTG-induced expression of S. bicolor UGT85B1 or R. serpentina AS on the 
toxicity of vanillin on three different strains of E. coll -: No growth; +: weak growth; ++: good 
growth. 
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From these experiments we conclude that expression of S. bicolor UGT85B1 is possible in some 
E. coli strains and with some types of IPTG-inducible gene promoters, and that a stronger 
vanillin detoxification can be obtained by expression of the AS gene, but only in an E. coli strain 
in which the phospho-fj-glucosidase encoding bgl locus has been inactivated (strain K0141 8). 

5 

For experimentation with constitutive expression of UGT85B1 and AS, the E. coli strains 
JM109, and K0141 8 were transformed with the constructed plasmids pJH 430, pJH43 1, 
pJH432, pJH433, pJH434, pJH435, pJH436, pJH437, pJH438, pJH439, pJH440, and pJH441, 
and pKOL30 as control, employing standard transformation protocols. Transformants were 

10 selected for ampicilin resistance, and two transformants of each E. coli strain with each plasmid 
were kept for expression experiments. The E. coli transformants were inoculated from plate 
cultures into 2 ml liquid growth medium (LB medium containing ampicilin at 100 jig/ml), and 
allowed to grow for 20 hours at 37 °C. The precultures were diluted 100 times 10 4 times and 10 6 
times in growth medium, and 4pJ droplets of these cell suspensions were applied to the surface 

15 of Petri dishes containing solid LB-ampicilin medium containing varying concentrations of 
vanillin (based on the STV determinations described above). The plates were incubated at 37 °C 
for 24 hours, after which growth on the various concentrations of vanillin was monitored and 
recorded. The results are summarized in Table 2. The results with E. coli strains JM109 and 
K01418 with the plasmids pJH430, pJH431, pJH432 and pJH433 are summarized in Table 2 

20 below. 



E. coli strain \ % Vanillin 


0% 0.08% 0.12% 0.16% 


JM109 [pKOL30] 


1 1 1 III* H~~H - 


JM109 [pJH430] 


+++ +++ +++ ++ 


JM109 [pJH431] 


+++ +++ +++ ++ 


JM109 [pJH432] 


■J* I" V f 1 1 1 1 1 ■ I + 


JM109 [pJH433] 


+-H- +++ -H-+ + 


K01418 [pKOL30] 


+++ +++ -H- + 
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K01418 [pJH430] 


+++ +-H- +++ +++ 


K01418 [pJH431] 


+++ +-H- +++ +++ 


K01418 [pJH432] 


+++ +++ +++ + 


K01418 [pJH433] 


Ml 1 1 1 1 1 1 I 1 1 



Table 2. Effect of constitutive expression of S. bicolor UGT85B1 or R. serpentina AS on 
the toxicity of vanillin on two different strains of E. colL No growth; +: weak growth; 
++: growth; +-H-: strong growth. 

5 

From these experiments we conclude that constitutive expression of glucosyl transferase genes in 
E. coli strain JM109 results in vanillin detoxification. 

EXAMPLE 12: Vanillin glucoside (VG) production in UGT85B1-, UGT89B1- or AS-expressing 
10 strains of E. coli 

Several E. coli strains expressing UGT85B1 or AS are tested for VG production by growth in 
liquid growth medium containing sublethal vanillin concentrations. The strains tested are JM109 
and K0141 8 containing either of the following plasmids: pKK223-3, pJH401 , pJH402, pJH406, 
15 pJH463, pJH410, pKOL30, pJH430, pJH431, pJH432, pJH433, pJH434, pJH435, pJH436, 
pJH437, pJH438, pJH439, pJH440 andpJH441. 

The E. coli strains are inoculated over night at 37 °C in 2 ml LB medium containing 100 [ig/ml 
ampicilin, and then diluted into 50 ml of the same growth medium (0.1 ml o.n. culture). Growth 

20 proceeds at 37 °C for 2 hours, and then IPTG is added to a concentration of 1 mM (0.5 ml 100 
mM stock) to those strains containing EPTG-inducible UGT/AS-expression plasmids. The cell 
suspensions are then incubated at 28 °C for 1 hour, after which vanillin is added to a 
concentration of 0.05 % (83 pi 30 % ethanol stock). Growth then proceeds at 28 °C for 24 
hours. Cultures are centrifizged LC-MS are employed to determine VG content in growth 

25 supernatant as well as lysates of cell pellets. 
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In another set of experiments, the same E. coli strains are grown in a similar way, but instead of 
vanillin, 14 C-labelled vanillin (5 mCi of 50 mCi/mol) is added to the growing cultures, after which 
growth proceeds 28 °C for 1-24 hours. Supernatant samples are taken regularly as are cell 
5 samples. Cell lysates are produced and both growth supernatants and cell lysates are subjected to 
TLC (thin layer chromatography) as described (Jones et al., 1999, J Biol Chem 274: 35483). 
Employing autoradiography, radioactive vanillin and radioactive vanillin glucoside are detected 

By both of the above described analyses we are able to detect the presence of vanillin glucoside 
10 from cells expressing UGT or AS genes while no vanillin glucoside can be detected from growth 
of cells not expressing UGT or AS genes. 

EXAMPLE 13: Establishment of a de novo biosynthesis pathway for the supply of vanillin to 
microbial expressed glucosytransferases 

15 

To establish a microbial pathway for the conversion of glucose to vanillin three heterologous 
enzymatic activities are expressed in the UDPG-glucosyltransferase expressing microorganism. 

The first enzymatic activity is a 3-dehydroshikimate dehydratase, which <c taps" out a distant 
20 vanillin precursor from the common aromatic amino acid biosynthetic pathway, namely 3- 

dehydroshikimate, and converts it to protocatechuic acid. 3-Dehydroshikimate dehydratase. 

(3DHD) genes are known from Neurospora crassa (Ruthledge, 1984, Gene 32: 275), 

Aspergillus nidulans (Hawkins et al., 1985, Curr Genet 9: 305), Podospora pauciseta (GenBank 

accession # AL627362) zsA Acinetobactercalcoaceticus (Elsemore & Qrnston, 1995, J 
25 Bacteriol 177: 5971). The next enzyme necessary is an aromatic carboxylic acid reductase 
. (AGAR) that can convert protocatechuic acid to protocatechualdehyde. ATP-dependent ACAR 
. activities are found in certain actinomycetes {Nocardia sp.) (use of Nocardia ACAR patented: 
• Rosazza & Lv2001, US 6261814B1), in Neurospora crassa (Gross & Zenk, 1969, Eur J 

Biochem 8: 413) (use of ACAR enzyme preparation patented: Frost, 2002,, US 6372461B1) and 
30 in a range of basidiomycetes, socalled "white-rof * fungi, which are lignin degraders (e.g. 

Trametes, Dichpmitus, Bjerkandera and Pleurotus ("Oyster mushroom") sp. (Hage et al., 1999, 



Ourref. 1068EP01 



49 

Appl Microbiol Biotechnol 52: 834). No data on AARC protein or nucleotide sequence are 
available, except for a short N-tenninal protein sequence from the Nocardia enzyme (Li & 
Rosazza, 1997, J Bacteriol 179: 3482). Finally, a 3-0-methylation is necessary for the final 
conversion to vanillin. The candidate enzyme for this purpuse is a strawberry S- 
5 adenosylmethionine:furaneol O-methyltransferase gene (FaOMT) (Wein et al., 2002, Plant J 3 1 : 
755), as this enzyme efficiently methylates vanillin at the 3-0. Alternative methylases include 
aspen (Bugos et al., 1991), Plant Mol Biol 17: 1203), almond (Garcia-Mas et al., 1995, Plant 
Phys 108:1341) and Vanilla planifolia (Xue & Brodelius, 1998, Plant Physiol Biochem 36: 779) 
caffeic acid OMTs. To avoid the accumulation of toxic intermediates the first gene expressed in 
1 0 a functional UGT-expressing microbial strain will be the methyl transferase gene. When this has 
been seen to work (i.e. producing vanillin ghicoside from protocatechualdehyde), the aldehyde 
dehydrogenase is introduced, etc., until the intrinsic 3 -dehydroshikimate-producing pathway is 
reached. 

1 5 The strawberry FaOMT gene is isolated by PCR, using Fragraria x ananassa (strawberry) 
cDNA as template DNA. The cDNA is isolated from maturing R ananassa var. Elanta using the 
QBiogene Fastprep Pro Green kit, and from the resulting total RNA preparation cDNA is 
synthesized employing the Invitrogen Superscript II Reverse transcriptase. This cDNA 
preparation is used for PCR amplification using FaOMT ORF-specific primers and Pwo 

20 polymerase (Roche Biochemicals). The resulting 1 .1 kb FaOMT-containing DNA fragment is 
isolated and inserted between the Xbal and BamHL sites in the inducible E. coli expression 
vector pJH-Xl, resulting in plasmid pJH-X2. 

E. coli strain JH1, which corresponds to E. coli JM109 containing plasmid pJH430 (expressing 
25 UGT85B1 from the E. coli fba promoter) is transformed with plasmid pJH-X2. The resulting E. 
coli strain JH2 is grown in liquid LB growth medium to which protocatechualdehyde is added. 
Expression is induced. The culture supernatant of the outgrown culture is subjected to LC-MS, 
and vanillin glucoside is identified, meaning that a biosynthetic pathway that can convert 
protocatechualdehyde to vanillin glucoside has been established. 



30 
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An AARC gene that is effective in the conversion of protocatechualdehyde to vanillin is isolated 
in the following manner: cDNA is synthesized from total RNA extracted from a white rot fungus 
(chosen from the species Pleurotus ostreatus, Trametes gibbosa or Dichomitus squalenes) by 
the use of Invitrogen Superscript II Reverse transcriptase. A cDNA library is produced using the 
5 Clontech Creator Smart cDNA fibrary construction system. The fungal cDNA inserts are 
transferred to the E. coli expression vector pJH-Xl -Cre, which has been modified from vector 
pJH-Xl by the use of the Clontech Creator Acceptor Vector Construction system. A number of 
20.000 clones from the resulting cDNA expression library are grown in liquid LB growth 
medium in microtiter plates (209 plates in total). Cell extracts from these clones are inoculates 

10 with protocatechuic acid, ATP and NADPH, and samples from the enzymatic assays are 
subjected to LC-MS analysis, in pools of 96 clones. An enzymatic activity that results in the 
conversion of protocatechuic acid to protocatechuic aldehyde is identified in one pool. The 
original cDNA expression clones corresponding to the 96 enzyme assays from this pool are 
grown again, and the assay procedure repeated. One clone is shown to contain a cDNA that 

15 when expressed leads to an enzymatic activity that can convert protocatechuic acid to 
protocatechuic aldehyde. The cDNA insert of this clone is subjected to DNA sequencing 
analysis, and after identification of this AARC gene, the gene is PCR amplified again in order to 
be able to insert it into the E. coli expression vector pJH-X2, resulting in plasmid pJH-X3. 

20 E. coli strain JH1 (expressing UGT85B1) is transformed with plasmid pJH-X3. The resulting E. 
coli strain JH3 is grown in liquid LB growth medium to which protocatechuic acid is added. 
Expression is induced The culture supernatant of the outgrown culture is subjected to LC-MS, 
and vanillin glucoside is identified, meaning that a biosynthetic pathway that can convert 
protocatechuic acid to vanillin glucoside has been established 

25 

A 3DHD gene is isolated from Podospora pauciseta in the following manner Genomic DNA is 
isolated from P, pauciseta using the QBiogene Fastprep DNA system, and the 3DHD gene is 
PCR amplified using this genomic DNA as template from the reaction with P. pauciseta 3DHD- 
specific primers #3 and #4 (Table 4), and the Pwo polymerase (Roche Biochemicals). The 
30 resulting 1 . 1 kb 3DHD gene fragment is inserted between the BamHL and Xbal sites of the E. 
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coli expression vector pJH-X3, resulting in plasmid pJH-Van. E. coli strain JH1 (expressing 
UGT85B1) is transformed with plasmid pJH-Van. 

The resulting E. coli strain JH4 is grown in liquid LB growth medium. Expression is induced 
5 The culture supernatant of the outgrown culture is subjected to LC-MS, and vanillin glucoside is 
identified, meaning that a biosynthetic pathway that can convert glucose to vanillin glucoside has 
been established. 

E. coli strain JM109 is now transformed with plasmid pJH-Van. The resulting strain, JH5 now . 
1 0 contains a biosynthesis pathway for vanillin production from glucose, while strain JEW contains a 
biosynthesis pathway for vanillin production in addition to the glucosyl transferase-encoding 
UGT85B1 gene. Strains JH4 and JH5 are both grown in liquid LB growth medium, and gene 
expression is induced By the means described above we compare vanillin production in strain 
JH5 with vanillin glucoside production in strain JH4. 

15 

We observe that the E. coli strain containing a biosynthesis pathway for vanillin production in 
addition to a glucosyl transferase gene (strain JH4) produces higher amounts (on a molar basis) 
of vanillin glucoside as compared to the amounts of the corresponding vanillin aglucone. 
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CLAIMS 

1. A method of producing a low molecular weight organic aglycon compound comprising 
following steps: 

5 a) growing a cell, which comprises a gene encoding a product involved in the 

biosynthesis pathway leading to a low molecular weight organic aglycon compound and a 
glycosyltransferase gene encoding a glycosyltransferase capable of glycosylating the 
produced aglycon, under suitable conditions wherein the cell produces the aglycon and 
the corresponding glycosylated form of the aglycon; 
10 b) deglycosylating the glycosylated form of the aglycon; and 

c) recovering the aglycon compound; 

(i) wherein the low molecular weight organic aglycon compound has a molecular 
weight from 50 to 3000, and 

(ii) wherein the glycosyltransferase is a glycosyltransferase capable of conjugating 
15 a sugar to the aglycon compound 

2. The method of claim 1, wherein the cell is a plant cell. 

3. The method of claim 2, wherein the cell is a microorganism cell or a filamentous fungi cell. 

20 

4. The method of claim 3, wherein the microorganism cell is an E. coli cell. 

5. The method of any of the preceding claims, wherein the glycosyltransferase gene is a 
heterologous glycosyltransferase gene. 

25 

6. The method of any of the preceding claims, wherein the glycosyltransferase is an UDPG- 
glycosyltransferase, preferably an UDPG-glucosyltransferase. 

7. The method of any of the preceding claims, wherein the low molecular weight organic aglycon 
30 compound has (i) a molecular weight from 50 to 1000, (ii) is an alcohol, in particular an 

aromatic alcohol and (iii) a secondary metabolite compound. 
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8. The method of claim 7, wherein secondary metabolite compound is a plant secondary 
metabolite compound selected from the group consisting of: 

• Terpenoids 

• Alkaloids 

5 • Phenylpropanoids 

• Phenyl derivatives 

• Hexanol derivatives 

• Flavonoids 

• Coumarins, stilbenes 
10 • Cyanohydrins 

• and Glucosinolates. 

9. The method of claim 8, wherein the plant secondary metabolite organic aglycon compound is 
vanillin. 

15 

10. The method of any of the preceding claims, wherein the deglycosylating step b) of claim 1 
takes place outside the growing cell following excretion or extraction of the in step a) produced 
glycosylated form of the aglycon and wherein the deglycosylating is an enzymatic process 
mediated by a beta-glucosidase. 
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ABSTRACT 

A method of producing a low molecular weight organic compound (e.g. a plant or bacteria 
secondary metabolite) in increased yields involving use of a cell (e.g. a plant or microorganism 
5 cell), which comprises a gene involved in the biosynthesis pathway leading to a low molecular 
weight organic aglycon compound and a glycosyltransferase gene capable of glycosylating the 
produced aglycon. 
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CLAIMS 

1. A method of producing a low molecular weight organic compound comprising following 
steps: 

5 a) fermenting a microorganism cell or a filamentous fungi cell, which comprises a gene 

encoding a product involved in the biosynthetic pathway leading to a low molecular 
weight organic aglycon compound and a heterologous glycosyltransferase gene encoding 
a glycosyltransferase capable of glycosylating the produced aglycon, in a suitable 
medium, wherein the cell produces the aglycon and the corresponding glycosylated form 

10 of the aglycon; and 

b) recovering the glycosylated form of the aglycon compound; 

(i) wherein the low molecular weight organic aglycon compound has a molecular 
weight from 50 to 3000 g/mol, and 

(ii) wherein the glycosyltransferase is a glycosyltransferase capable of conjugating 
15 a sugar to the aglycon compound, wherein the sugar is a sugar selected from the 

group consisting of galactose, glucosamine, N-acetylglucosamine, xylose, 
glucuronic acid, rhamnose and glucose. 

2. A microorganism cell or a filamentous fungi cell that comprises a gene encoding a product 
20 involved in the biosynthetic pathway leading to a low molecular weight organic aglycon 

compound and a heterologous glycosyltransferase gene encoding a glycosyltransferase capable 
of glycosylating the produced aglycon, wherein the cell, when fermented in a suitable medium, 
produces the aglycon and the corresponding glycosylated form of the aglycon, and wherein the 
glycosyltransferase is a glycosyltransferase capable of conjugating a sugar to the aglycon 
25 compound, wherein the sugar is a sugar selected from the group consisting of galactose, 
glucosamine, N-acetylglucosamine, xylose, glucuronic acid, rhamnose and glucose. 

3. A method of producing a low molecular weight organic aglycon compound comprising 
following steps: 

30 a) growing a cell, which comprises a gene encoding a product involved in the 

biosynthesis pathway leading to a low molecular weight organic aglycon compound and a 
glycosyltransferase gene encoding a glycosyltransferase capable of glycosylating the 
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produced aglycon, under suitable conditions wherein the cell produces the aglycon and 
the corresponding glycosylated form of the aglycon; 

b) deglycosylating the glycosylated form of the aglycon; and 

c) recovering the aglycon compound; 

5 (i) wherein the low molecular weight organic aglycon compound has a molecular 

weight from 50 to 3000, and 

(ii) wherein the glycosyltransferase is a glycosyltransferase capable of conjugating 
a sugar to the aglycon compound 

10 4. A method for selecting a cell with increased production of a glycosylated form of a low 
molecular weight organic aglycon compound comprising following steps: 

a) growing a cell, which comprises a gene encoding a product involved in the 
biosynthesis pathway leading to a low molecular weight organic aglycon compound and a 
glycosyltransferase gene encoding a glycosyltransferase capable of glycosylating the 

1 5 produced aglycon, under suitable conditions wherein the cell produces the aglycon and 

the corresponding glycosylated form of the aglycon; 

b) treating the cell in a way that changes the expression level of at least one gene 
involved in the biosynthesis pathway leading to a low molecular weight organic aglycon 
and/or the glycosyltransferase gene capable of glycosylating the produced aglycon in 

20 order to make a library of cells with different expression levels of the genes; and 

c) selecting a cell that produces a higher amount of the glycosylated form of the aglycon 
as compared to the cell of step a); 

(i) wherein the low molecular weight organic aglycon compound has a molecular 
weight from 50 to 3000, and 
25 (iii) wherein the glycosyltransferase is a glycosyltransferase capable of 

conjugating a sugar to the aglycon compound 

5. The method of claim 3 or 4, wherein the cell is a plant cell. 

30 6. The method of claim 3 or 4, wherein the cell is a microorganism cell or a filamentous fungi 
cell. . 
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71 The method of any of claims 1 and 6 or the microorganism cell of claim 2, wherein the 
microorganism cell is an E. coli cell. 

8. The method of any of the claims 3-7, wherein the glycosyltransferase gene is a heterologous 
5 glycosyltransferase gene. 

9. The method claims or microorganism cell claims of any of the preceding claims, wherein low 
molecular weight organic aglycon compound has (i) a molecular weight from 50 to 1000, (ii) is 
an alcohol, in particular an aromatic alcohol and (iii) a secondary metabolite compound. 

10 

10. The method claims or microorganism cell claims of any of the preceding claims, wherein the 
glycosyltransferase is an UDPG-glycosyltransferase, preferably an UDPG-glucosyltransferase. 

11. The method of any of claims 3 and 540, wherein the deglycosylating step b) of claim 3 takes 
1 5 place outside the growing cell following excretion or extraction of the in step a) produced 

glycosylated form of the aglycon and wherein the deglycosylating is an enzymatic process 
mediated by a beta-glucosidase. 
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ABSTRACT 

A method of producing a low molecular weight organic compound (e.g. a plant or bacteria 
secondary metabolite) in increased yields involving use of a cell (e.g. a plant or microorganism 
5 cell), which comprises a gene involved in the biosynthesis pathway leading to a low molecular 
weight organic aglycon compound and a glycosyltransferase gene capable of glycosylating the 
produced aglycon. 



